Gene NATL1_03681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_03681 
SymbolchlD 
ID4781227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp340429 
End bp342582 
Gene Length2154 bp 
Protein Length717 aa 
Translation table11 
GC content43% 
IMG OID640083636 
Productprotoporphyrin IX magnesium chelatase subunit ChlD 
Protein accessionYP_001014197 
Protein GI124025081 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1239] Mg-chelatase subunit ChlI 
TIGRFAM ID[TIGR02031] magnesium chelatase ATPase subunit D
[TIGR02442] cobaltochelatase subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTTCCA GTGGATTGAA CAATAACAAT ATGTCTTCGA CTTCAAAAGA CATTGCTGTA 
AAGGATAGAG CCAATCTTGC TTTCCCTTTG GCTGCAATTA CAGGCCATGG AACTTTAAAG
CTTGCTTTAA TGCTCGCAGC AGTTGATCCA GGTTTGGGAG GAGTAATAAT TGCGGGCGGT
CGCGGGACTG GGAAATCTGT TTTAGCTAGA GGCTTGCATG CCCTTCTCCC GCCGATTGAA
ATAATCGATT TAGACGAATT AGCCAATACT GAGGATGGAG ATGATTCATC TATTTATCCG
GCAGGTAGAA ATTTAGATCC TTCTTTTTCA GGAGAATGGG ATGATTTAAC TAAAAAACTT
TTTACAAAAA AGATAGGGAG TTATGAAAAT ACTGATGACT TAGAAAATAT CCCTACAAAA
GTTGTTTCTG CGCCATTTAT TCAGGTTCCC TTAGGAGTAA CTGAGGATAG GCTTGTTGGT
GCAGTTGATG TCGCTGCTTC ATTATCAAGC GGAACTCCAG TATTTCAGCC AGGCTTACTA
GCTGAGGCTC ACAGAGGAGT TTTGTACATA GACGAATTGA ATTTATTGGA TGACAACATT
GTCAATCTTC TTTTGGCTTC AGTAGGTGCA GGTGAAAATA GAGTTGAACG AGAAGGATTG
AGTCTAAGTC ACCCATGCCG TCCTCTTTTG ATTGCTACTT ACAATCCTGA AGAGGGGGGA
TTAAGGGACC ACCTCTTGGA CAGATTTGCA ATTGTTTTAT CTGCAGATCA ATTAATTACA
AACGAACAAA GAGTTGAAAT AACTCAATCT GCCATTTCTC ATGGTCAATC AAGTGAAGCT
TTTTCTAAGA AATGGTCTGA AGACACAGAA TCTCTATCAA CACAATTGCT TTTGGCAAGG
CAATGGCTTC CAGATGTTCA AATCAGTGAG GATCAAATTG AATACTTAGT TGTGGAAGCC
ATACGAGGAG GTGTAGAAGG ACACAGATCA GAGCTTTATG CAGTAAGGGT TGCAAAGGCT
CATGCTGCCT TGTGTGGAAG GGATTCAGTG GACGCAGAAG ATTTAAAAGC GGCAGTAAGA
CTTGTCATTG CTCCCAGGGC TATGCAAATG CCGTCAGAAG AGGAGATGGA GCCTCCGGCC
CCTGAAGATC AGCAGCCACC ACCACCGCCT CCTGAAGACT CTGATGACAA TAATGATCAA
GAGGAAGATC AAGAAGAAGA TCAGGAAGAG GAGCAAGATG AAGAGTCATC CCCTCCAATC
CCAGAGGAAT TTATGCTTGA TCCAGAAGCA TGTGCCGTAG ATCCTGATTT GTTGTTGTTT
TCATCCACTA AATCAAAGAG TGGAAATAGC GGAAGTCGAT CAGCTGTCTT AAGCGATAAC
AGAGGTCGAT ACGTAAAACC AATTATTCCT AGAGGACCTG TTAGAAGGAT TGCCGTTGAT
GCAACCTTAA GAGCTGCAGC TCCTTATCAG AAAGCGAGAA GGGAAAGAGA ACCTAATAGG
AAAGTTATTG TTGAGGAAGG CGATTTACGC GCAAAATTAC TGCAGCGTAA AGCTGGTGCA
TTGGTAATTT TCTTGGTTGA TGCAAGTGGA TCAATGGCGC TCAATAGAAT GCAAAGTGCA
AAAGGAGCTG TTATTCGGTT GCTAACCGAA GCTTATGAAA ACAGAGATGA AGTTTCTCTT
ATACCTTTTA GAGGGGATCA AGCAGAAGTT TTACTCCCTC CAACTAGATC AATCACCGCT
GCGAAAAGGA GACTTGAGGC AATGCCATGT GGTGGTGGCT CTCCACTAGC TCATGGGTTA
ACGCAGGCTG CAAGAGTTGG AGCAAATGCC TTAGCCACAG GAGATCTCGG ACAGGTTGTA
GTAGTGGCAA TTACTGATGG GCGTGGAAAC GTTCCATTAA GCACTTCACT AGGTCAACCA
ATTCTTGAGG GAGAGACACC ACCCGACTTG AAGCAAGAAG TTCTTGATGT TGCTTCGCGT
TATCGTGCGC TTGGTATTAA GTTGCTTGTA ATTGATACAG AAAGAAAGTT TATTGCTAGT
GGAATGGGTA AAGACTTAGC TGAAGCATCC GGAGGTAAAT ATGTTCAATT ACCTAAGGCA
AGTGATAAGG CTATTGCTTC AATTGCAATG GATGCGATTA ATAGCGTTAC CTGA
 
Protein sequence
MVSSGLNNNN MSSTSKDIAV KDRANLAFPL AAITGHGTLK LALMLAAVDP GLGGVIIAGG 
RGTGKSVLAR GLHALLPPIE IIDLDELANT EDGDDSSIYP AGRNLDPSFS GEWDDLTKKL
FTKKIGSYEN TDDLENIPTK VVSAPFIQVP LGVTEDRLVG AVDVAASLSS GTPVFQPGLL
AEAHRGVLYI DELNLLDDNI VNLLLASVGA GENRVEREGL SLSHPCRPLL IATYNPEEGG
LRDHLLDRFA IVLSADQLIT NEQRVEITQS AISHGQSSEA FSKKWSEDTE SLSTQLLLAR
QWLPDVQISE DQIEYLVVEA IRGGVEGHRS ELYAVRVAKA HAALCGRDSV DAEDLKAAVR
LVIAPRAMQM PSEEEMEPPA PEDQQPPPPP PEDSDDNNDQ EEDQEEDQEE EQDEESSPPI
PEEFMLDPEA CAVDPDLLLF SSTKSKSGNS GSRSAVLSDN RGRYVKPIIP RGPVRRIAVD
ATLRAAAPYQ KARREREPNR KVIVEEGDLR AKLLQRKAGA LVIFLVDASG SMALNRMQSA
KGAVIRLLTE AYENRDEVSL IPFRGDQAEV LLPPTRSITA AKRRLEAMPC GGGSPLAHGL
TQAARVGANA LATGDLGQVV VVAITDGRGN VPLSTSLGQP ILEGETPPDL KQEVLDVASR
YRALGIKLLV IDTERKFIAS GMGKDLAEAS GGKYVQLPKA SDKAIASIAM DAINSVT