Gene NATL1_08621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_08621 
Symbol 
ID4779332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp792687 
End bp793934 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content29% 
IMG OID640084137 
Productglycosyltransferase-like protein 
Protein accessionYP_001014685 
Protein GI124025569 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.784838 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000054897 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTAAAGA TTTGGATGAT AAATCAGTTT GCAAATACAC CTGATTTGCC AGGGCATACA 
AGAAACTACG AAGTTGCTAA ATATTTAGTT AAAAATGGAT GGAGAGTAGA TTTATTTGCA
TCTGACTTTA ATCTTAGTCA GAGAAAGTAT TGCAAGTTAA AGAAATTTGA ATTATTTAAA
ATCAATAAAA TTGATGGAAT AAAATGGCAT TGGTTAAGAG TATCTTCTTA TTCAATTAAT
AATTGGAAAA GATATCTAAA CATATTAAGT TTTTCAATTC ATATATTTTT ATTTCTTATA
TTAAAATCTA TTCTATCTTT AAAAAGAGAG CAATTACCAA ATATAATATT AGCTAGTTCT
CCTCAATTAC CAGCAGCATA TTTTTGCTTA ATAGTTTCAA AAATTCTCCA TATACCATTG
GTTTTAGAAA TAAGAGATCT ATGGCCACAA ATTCTTATTG ATCTAGGAGA TAAAAGCAAA
GAAAACATAT TATATAGGAT ATTATATTGG ATGGAGCTTT TATTATATAA GGAATCAAAA
ATTATAGTTA TTTTATCTAA AGGGTCAAAG GAACATGTAG TAAAAAAAGG AGGGAAACTT
ATAAAATGGT TACCAAATGG GCCAGATCTA AGTAAATTCA AATTTTCAAG TCTACCTAAT
GAAGAATTAA CTTTTTCTTT TAATAGACCC TTTAGATTGG TATATGCGGG AGCTCACAGC
CAAGTTAATG GTTTGATGTA CGTTTTGAAT GCTGCTAAGT TATTGATCAA TCACCCAATT
GAAATTACAT TGATAGGGGA TGGTCCAGAG AAAAATAATC TTGTTGAACA ATCAAAAAAA
CTTGCTTTAA ATAATGTAAA ATTTTTAAAA CCTCATTCTA AAGATAATAT TCCAAAAATT
CTATCTACAT TCGATGCAAT ATTACTTTCA CTAATAGATT CTGACCTATT TAGATATGGA
ATATCTCCCA ATAAGTTATA TGATGCTTAT GCTCTTGGAA GACCAGTAAT AACTACTGTC
CCAGGAATGA TAAATGATGA AGTAGAGTCA AATAAATTAG GGACTACTTC AAGAGCATGT
GATTCATCTT CATTATCATT AGCAATAAAA AGATTAATGA ACACATCAAG GAGAGATAGA
GAAATGATGG GGATCAGAGC TAGATCTATA GCCGAAAAAA CATATTCAAG AAATCGAATC
AACAAAGAAT ATGATAAACT TCTGCGCTCA TTAATACCTA ATGAATAA
 
Protein sequence
MLKIWMINQF ANTPDLPGHT RNYEVAKYLV KNGWRVDLFA SDFNLSQRKY CKLKKFELFK 
INKIDGIKWH WLRVSSYSIN NWKRYLNILS FSIHIFLFLI LKSILSLKRE QLPNIILASS
PQLPAAYFCL IVSKILHIPL VLEIRDLWPQ ILIDLGDKSK ENILYRILYW MELLLYKESK
IIVILSKGSK EHVVKKGGKL IKWLPNGPDL SKFKFSSLPN EELTFSFNRP FRLVYAGAHS
QVNGLMYVLN AAKLLINHPI EITLIGDGPE KNNLVEQSKK LALNNVKFLK PHSKDNIPKI
LSTFDAILLS LIDSDLFRYG ISPNKLYDAY ALGRPVITTV PGMINDEVES NKLGTTSRAC
DSSSLSLAIK RLMNTSRRDR EMMGIRARSI AEKTYSRNRI NKEYDKLLRS LIPNE