Gene NATL1_20161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_20161 
Symbol 
ID4779553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1659082 
End bp1660233 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content43% 
IMG OID640085308 
Producthypothetical protein 
Protein accessionYP_001015836 
Protein GI124026721 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2109] ATP:corrinoid adenosyltransferase 
TIGRFAM ID[TIGR00708] cob(I)alamin adenosyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.137745 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTTCAA ACGGTATAGG GATTACGACA GCATCGGAAA GCTTGGAACG TAGTCAAGGA 
CAATTACATG TTTATGACGG AGAGGGTAAA GGGAAGAGCC AGGCAGCCTT GGGGGTCGTT
CTGAGGACCA TAGGTCTAGG CATATGTGAG AAAAGACAAA CAAGGGTATT ACTTCTTAGA
TTCTTGAAAG GACCTGGTCG CTCGTATGAC GAAGATGCCG CAATAGATGC TTTGCAGCAA
GGCTTCCCTC ACTTGATTGA TCAAGTGAGG ACTGGCAGGG GAGAATTTTT TAGCGCCGAT
CAATCTACCA AATTTGATTA TCAGGAAGCT CAAAGAGGTT GGGACATAGC CAAGGGGGCA
ATCGCTAGTG CCTTGTATTC AGTTGTTGTC CTCGACGAAT TGAATCCTGT TCTGGATTTA
GGATTATTGC CTGTTGAAGA AGTTGTTAAA ACACTTAAGT CAAGACCAAA CGGTATGGAA
ATTATCGTTA CTGGAAGAGC TGCACCAAAT CCTCTGATTA AAGTTGCGGA ACTGCATTCT
GAGATGAGAG CTCACAGACG ACCTGAGATT AGTAACGATG AAATTCTTTT TGAAAATAAT
GTTGGTGGGA TTGAAATATA TACGGGTGAA GGAAAAGGCA AATCAACCAG TGCGTTGGGT
AAAGCTTTAC AAGCTATCGG TAGAGGAATA AGTCAGGACA AAAGTCATCG TGTTTTGATT
TTGCAATGGC TGAAGGGTGG TAGTGGTTAC ACAGAGGATG CCGCTATTGC GGCTCTTCGA
GAAAGTTATC CTCATTTAGT AGACCATCTT CGATCTGGTA GAGATGCGAT TGTTTGGAGG
GGCCAGCAAA AGCCCATTGA CTATGTAGAG GCTGAAAGAG CATGGGAAAT TGCAAGGGCA
GCTATTTCAA GTGGTCTTTA TAAGACTGTG ATTTTGGATG AGTTAAATCC AACCGTTGAT
TTGGAACTCC TCCCAGTTGA GCCTATTGTT CAAACATTGC TTCGTAAACC TTCCGAAACC
GAGGTGATTA TTACAGGAAG ATGCAAAAAC CAACCTATAT ATTTTGATTT AGCAAGTGTT
CATTCTGAGA TGGTGTGTCA CAAGCACTAT GCTGAAAAAG GAGTTGATTT AAAAAGGGGA
GTTGATTATT AG
 
Protein sequence
MVSNGIGITT ASESLERSQG QLHVYDGEGK GKSQAALGVV LRTIGLGICE KRQTRVLLLR 
FLKGPGRSYD EDAAIDALQQ GFPHLIDQVR TGRGEFFSAD QSTKFDYQEA QRGWDIAKGA
IASALYSVVV LDELNPVLDL GLLPVEEVVK TLKSRPNGME IIVTGRAAPN PLIKVAELHS
EMRAHRRPEI SNDEILFENN VGGIEIYTGE GKGKSTSALG KALQAIGRGI SQDKSHRVLI
LQWLKGGSGY TEDAAIAALR ESYPHLVDHL RSGRDAIVWR GQQKPIDYVE AERAWEIARA
AISSGLYKTV ILDELNPTVD LELLPVEPIV QTLLRKPSET EVIITGRCKN QPIYFDLASV
HSEMVCHKHY AEKGVDLKRG VDY