Gene NATL1_00341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_00341 
SymbolcbiD 
ID4780273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp33942 
End bp35087 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content37% 
IMG OID640083297 
Productcobalt-precorrin-6A synthase 
Protein accessionYP_001013863 
Protein GI124024747 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1903] Cobalamin biosynthesis protein CbiD 
TIGRFAM ID[TIGR00312] cobalamin biosynthesis protein CbiD 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.136591 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAATCAAT TTACTCTTCC CGTTTGGGTG GTTGCTGCTG CAAAGTCAGC AACGAATATT 
CTTATTGGTA ATAAATTTAG GGATAAAGAG CGAATTGATT TACCAAATAA AGAAGAATCG
ATTTCGGTAC CTATTTCTTC TTCTGCTTTA CTCGATAACG GTAAAAGATC TTTAGCAGTA
AGTCATTGTC AGTCTGGATT GCCTCTTGAC ATAACAAGAG GAGTAGAAAT CTGGGCTTAT
ATTCAATTAA GTAAAGGAAG TTCTCAATCT AAAGGGAAAG TTCAAAATGG TTTTCCTGAT
TGGCTTGATT TTCATGCCGG TTGTGGAGTA GGTAAATTTC AATCATCTGG TCAGCCATGT
ATTTCTCAGT TTGCGCGTGA CTTGCTATGT ATTAATCTTT ACCCTCTTGT ACCCAAAGGT
AATTCAATTA AAGTTGAGAT TATTTTACCT GAAGGGAAAG ATCGTGCATC AAAGACAAGT
AATGAAGCCT TTGGAGTTGT AGATGGATTG TCCCTCATTG GGACCCAGGC TGAGGTTCAA
ATTAGTGCTT CTCCAGATCA GTTGAAAAAC TGCAAAGAGA TTTTGTACCA CAAATGCTCT
GAAGCAAAAT TTGATGGATG TTTGACTTTT GTGATTGGTG AAAATGGAAT GGATTTAGCG
ATGAAATATG GCCTGCCAGC TAATCAAATT ATTAAAACCG GGAATTGGCT AGGTCCTCTT
CTTGTTGCTG CTGCAGAAAA TGGAGTCAAG AAACTTTTAT TATTTGGATA TCATGGAAAA
CTTATAAAAC TTTCTGGCGG CGTTTTTCAT ACACATCATC ATCTTGCTGA TGGAAGGATT
GAAATACTCA CGTCACTTGC ATTCAGAGAA GGAATCTCAT TTGATTTGAT TGAGTTAATA
AGTAAATCAA CATCAGTGGA AAATGCTTTA TTAACCCTTG AAGTAAGTAA CCCAGATGCT
GTGTCTTTGA TATGGAGCAG GATGGCTAAA GAAATTGAAA TTAAAAGCAG AAGCTATGTG
AATAGATACT TGTCTTCATC AATGGAAATA GGATCTGTTT TATTTGATCG TAAGAGACAA
ATGCGTTGGG CTGGTCTTGA GGGTTTAAAA CAGATTAATT CTTTGGGGTT AATTCTTAAG
CGATAG
 
Protein sequence
MNQFTLPVWV VAAAKSATNI LIGNKFRDKE RIDLPNKEES ISVPISSSAL LDNGKRSLAV 
SHCQSGLPLD ITRGVEIWAY IQLSKGSSQS KGKVQNGFPD WLDFHAGCGV GKFQSSGQPC
ISQFARDLLC INLYPLVPKG NSIKVEIILP EGKDRASKTS NEAFGVVDGL SLIGTQAEVQ
ISASPDQLKN CKEILYHKCS EAKFDGCLTF VIGENGMDLA MKYGLPANQI IKTGNWLGPL
LVAAAENGVK KLLLFGYHGK LIKLSGGVFH THHHLADGRI EILTSLAFRE GISFDLIELI
SKSTSVENAL LTLEVSNPDA VSLIWSRMAK EIEIKSRSYV NRYLSSSMEI GSVLFDRKRQ
MRWAGLEGLK QINSLGLILK R