Gene NATL1_10041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_10041 
SymbolcobN 
ID4780720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp924796 
End bp928584 
Gene Length3789 bp 
Protein Length1262 aa 
Translation table11 
GC content31% 
IMG OID640084282 
Productcobalamin biosynthetic protein CobN 
Protein accessionYP_001014827 
Protein GI124025711 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1429] Cobalamin biosynthesis protein CobN and related Mg-chelatases 
TIGRFAM ID[TIGR02257] cobaltochelatase, CobN subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.460676 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATAGAA TTGCCAGCCT TCCTGGGAAT GATCCTCAAG AAGAAATTAC ATTTATTGAA 
CAACCAAATG CACCAGTTAT CTTCTTAACT AGTGCTACAT CCGACATAAC ATGCTTATCC
TCTGTATTAA AGCAACCTAA AAATAAAAAG TGGAGAAATA AAATCAGGGC TTTACCAATA
GCCTATTTAT CATCAAATGC ATCAATTGAC CATTATATAT CCAATACATG TAATACAGCG
GAAATCGTTG TTGTTAGGTT TTTAGGATCA AGATCATATT GGTCATATGG ATTTGAACAA
TTAAGTTTAT GGCAATTAGA GAAACCTAAT AGACAATTAA TAGTAGTCTC TGGCATAGAG
AGTACAGCAA ATGATCTTAA AGATATTAGT AGTATAAAAA AAGTTCAAGT TGATTTTATT
CAACTTCTAT TAAATGAAGG CGGTTTAAAA AACTATACTT ACCTATTAAA GGTTTTAGAT
AACTTAAAAA GTTTAGAGGA AATTAAAGTA GAAAGAGATT TAATTGAATA TCATGATGAT
TTAGTCAAAT GGAAATGGAT AGATAATGAT TACCCTAAGA TTGCTATATT TCTCTACAAA
TCATTGTTGC AATCAGGCAA TACAGAATTA GCAGACTCAA TAGTTAAAAT TTCAAAAAGA
CATCAGATAA ATGCCAAAAT CGTATGGATT ACTAGTTTTA AGTCAAAAGA TATTCAGAAA
GAGATAATAA ATTTATTGGA TAATGAAAAA ATTGAAGCGA TCATTACTAC TACTTCATTT
TCATCAGTTG AATATAAGAA TGATAATATA AAAAATAGCA TATGGGACAC ATTAGACGTT
CCTGTCTATC AACTACTTAT ATCCAGTTCG ACGGTTAATG AATGGAAAAA ATCAACAGTA
GGTTTAAATC CAATTGACTT ATCAATTCAA GTTGTTCTGC CAGAAGTAGA TGGTCGAATT
TGTACAGTTC CAATTGCATT TAAAAATCTT AGCTATACAG ATAATGAATT ATCTATATCT
GTATATAAGA CAGAACCTTA TGACAAACAT ATTGAATGGT GTATTAATTA CATAAAAAAT
CTTATTTATC TTAGAAAAAC AAGTAATTTC AATAAAAAAA TTGCTGTTGT AATAGCTAAT
TATCCTGTTA AGGATTCAAG AATTGCAAAT GGAGTTGGTT TAGATACACC AGAAAGCCTT
TTGTACGTGT TGAGATGGTT AAAAGATGAA GGATATTATT TAGGAGATAA CCCCTTGCCA
CGAAGTAGTA AAGAGCTAAT TAATAAGTTA ATTACTTATA GGACAAATTC TGAAGAAACC
ATATCTAACG AACCCCAATC ATATTTAAGC CTTTTTGATT ATTTATACCA TTGGAATAAA
ATTGAATCTA ATGCGAAGGA AAAGTTAATT AAACGATGGG GAGAACCAAA TTTTTCAAAA
CAATTAGAAG AAAGCGGGTT CCCGATCAAT GGATTCACTC TAGGAAATAT AACTGTATTA
GTTCAAGCAA ATAGAGGCTT TGAATCAGAT AATTTATCAG ATTTGCATTC ACCAGATCTA
CCACCTACAC ATAAATATAT TGCGCAATAT TTTTGGTTAA CGCATTCATT TAAGGCAAAT
GCAATTATAC ATCTAGGAAA GCACGGTAGC GTTGAATGGT TACCTGGTAA AGGTGTTGGT
TTAAGTAGTA ATTGTTTCCC ATTTATTATT TGCCCACCAG TACCAAACAT ATACCCATTT
ATAGTAAATG ACCCTGGAGA AGGCTCACAA GCAAAAAGAA GGACACATGC GATAATTATT
GATCATTTAA CACCACCACT TTCTAACGCT GGTAGTTATA ATGAGTTATT AGAAATTGAG
AACTTAATTG ATGAATATTA TGAATCTAAA CTTATTAATG ATGATAGAAA TCAGTTATTA
GAAAAAAAGA TTATAGAGTT ATTAATTAAC AATTCATGGC CAAGTATTGA CCCAAATATA
ATTAAAGATG ATATAAAAAA ATGTAATATA CAAGAGATAA TAGATTCTGC AGAAAGTTAT
TTATGTGAAT TAAAACAATC CCAAATAAGG ACAGGACTTC ATATCTTGGG TGTGAATCTA
AGTATGGATA AACTAATTGA ATTAACGTTG GCTATATCAA ATGTACCTAC AGGGAACATC
TCTGGTTTAT CACAATGTTT AGCTGAAGAT TTAGGTTTCA CCATAGACCC CTGGAGAGAT
GAAGAAAATT TGGACTTAAC GCAATCAGAT ATTAATTTAT TTAAAGAATA TACCTCTATT
AATGCAAGAA AAGTAGGGCA GCTAATTGAC TGGTTTAATG AAATTGGAAT GTATATAATT
GAGTTTCATT GTATACACAT ACTTAATCTC GAAACCAGTG CCAAAAATAA AATTGAACTT
GATAGTAAAC TACTAAAATA TTTAGATATT GAAAACCCAA ATATATTTAT AAATCATTTA
ATAACTGATA TTTTACCTAG ATTATTAAAA AGTTCCAAAA ACGAAAAGAC TAATTTACTT
AAAGCGTTGG AAGGGAAGAG AATTGCAAGC GGCCCTTCTG GTGCTCCAAC AAGAGGAAAA
TTAGAGGTCT TACCGACAGG TAAAAATTTC TTTTCTGTTG ATATTAGAGC TATTCCAACA
GAAACAGCAT GGGACCTAGG GAAAAGAAGT GCTGAAAAAT TAGTTGATCT TTACCTACAA
GAAAATGGAG AGCATCTATT ACATCTCGCT ATTTCAATAT GGGGAACTTC AACCATGAGA
AATGGAGGAG AAGATATTTG TCAACTATTT GCACTTATGG GTCTAATGCC TATATGGGAT
GGAACTTTAA GAAGAGTTGT TGACGTTGAA GTCATTCCCA TGAGTGTTTT AAATAGGCCC
AGAGTCGACG TCACATTAAG AATATCTGGT TTATTTAGGG ATGCATTCCC CCAAATCATT
GAATTAATAC GAAGAGGACA AAATCTTATT GGGAATTTAA ATGAGCCCAA AGCAATTAAT
CCCCTTGCAG AGTCTTATAG AAATGGAAAC ACCGACTCAA GAATATATGG ATCAGCACCT
GGATCATATG GAGCAGGTTT GCAAGAAATT ATCAATAATG GCTCATGGGA ACATCAATCT
GAACTGGCTA ATGCATTTAT AGAATGGAGT AAATGGCGAT ATGAAGGTTC AAACAATATT
GTTAAAGACA AAAAAGGGTT GGAATATAAT TTATCAAAAG TAAAAGTAGT GCTACATAGT
CAAGATAATA GAGAGCACGA CATACTCGAT TCAGATGATT ACTATCAATT TCAGGGTGGC
TTAATCTCTG CAGTAAAGAA AACTAGTGGT AACAATCCAC AAGCTTATTT TGCAGATAAT
TCTAGATATC AACGACCAAG AATACATAAA CTTTCAAAAG AAATAGATAA GGTTGTTAGG
AGTAGGTTAT TAAATCCAAA ATGGCTTGAG GGAATGAAAC AACATGGCTA TAAAGGAGCT
TTCGAGATGT CAGCAAGCCT TGACTATTTA TTCTCTTTTG ATGCAACAAC TAACCTTGTT
CCTAATTGGT GTTATGAGTC AATTGCAAGT AATTGGTTGG AAGATGAAAA GACAAGAAAA
TTTATTATTG ATAACAATCC ATGGGCATTA AGAGATATCG CTGAAAGACT ACTAGAAGCA
TCAAATCGAA AGCTTTGGAC AAATGCAACT AATGAAGAAA TAGCATCTAT TAAATCTGTT
TTATCTGATA TAGACAGTAA AATTGAAAAA TTTAATTCCA GTAGAAGAAA TAATAAATCA
GCGTCTTAA
 
Protein sequence
MHRIASLPGN DPQEEITFIE QPNAPVIFLT SATSDITCLS SVLKQPKNKK WRNKIRALPI 
AYLSSNASID HYISNTCNTA EIVVVRFLGS RSYWSYGFEQ LSLWQLEKPN RQLIVVSGIE
STANDLKDIS SIKKVQVDFI QLLLNEGGLK NYTYLLKVLD NLKSLEEIKV ERDLIEYHDD
LVKWKWIDND YPKIAIFLYK SLLQSGNTEL ADSIVKISKR HQINAKIVWI TSFKSKDIQK
EIINLLDNEK IEAIITTTSF SSVEYKNDNI KNSIWDTLDV PVYQLLISSS TVNEWKKSTV
GLNPIDLSIQ VVLPEVDGRI CTVPIAFKNL SYTDNELSIS VYKTEPYDKH IEWCINYIKN
LIYLRKTSNF NKKIAVVIAN YPVKDSRIAN GVGLDTPESL LYVLRWLKDE GYYLGDNPLP
RSSKELINKL ITYRTNSEET ISNEPQSYLS LFDYLYHWNK IESNAKEKLI KRWGEPNFSK
QLEESGFPIN GFTLGNITVL VQANRGFESD NLSDLHSPDL PPTHKYIAQY FWLTHSFKAN
AIIHLGKHGS VEWLPGKGVG LSSNCFPFII CPPVPNIYPF IVNDPGEGSQ AKRRTHAIII
DHLTPPLSNA GSYNELLEIE NLIDEYYESK LINDDRNQLL EKKIIELLIN NSWPSIDPNI
IKDDIKKCNI QEIIDSAESY LCELKQSQIR TGLHILGVNL SMDKLIELTL AISNVPTGNI
SGLSQCLAED LGFTIDPWRD EENLDLTQSD INLFKEYTSI NARKVGQLID WFNEIGMYII
EFHCIHILNL ETSAKNKIEL DSKLLKYLDI ENPNIFINHL ITDILPRLLK SSKNEKTNLL
KALEGKRIAS GPSGAPTRGK LEVLPTGKNF FSVDIRAIPT ETAWDLGKRS AEKLVDLYLQ
ENGEHLLHLA ISIWGTSTMR NGGEDICQLF ALMGLMPIWD GTLRRVVDVE VIPMSVLNRP
RVDVTLRISG LFRDAFPQII ELIRRGQNLI GNLNEPKAIN PLAESYRNGN TDSRIYGSAP
GSYGAGLQEI INNGSWEHQS ELANAFIEWS KWRYEGSNNI VKDKKGLEYN LSKVKVVLHS
QDNREHDILD SDDYYQFQGG LISAVKKTSG NNPQAYFADN SRYQRPRIHK LSKEIDKVVR
SRLLNPKWLE GMKQHGYKGA FEMSASLDYL FSFDATTNLV PNWCYESIAS NWLEDEKTRK
FIIDNNPWAL RDIAERLLEA SNRKLWTNAT NEEIASIKSV LSDIDSKIEK FNSSRRNNKS
AS