Gene A9601_08361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_08361 
SymbolilvD 
ID4717541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp724972 
End bp726645 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content37% 
IMG OID640078548 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001009227 
Protein GI123968369 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAC TCAGATCATC TGCAATAACC CAAGGTGTGC AAAGATCACC TAACAGATCG 
ATGTTAAGAG CTGTTGGATT TAATGATGAA GATTTTAATA AACCTATTAT TGGAGTTGCA
AATGGATACA GCACCATAAC ACCATGCAAT ATGGGTTTAA ATAAGTTAGC TCTAAAAGCT
GAAGAGTCTA TAAAAAGATC AGGTGGGATG CCTCAGATGT TTGGGACTAT AACAGTAAGT
GATGGGATTT CTATGGGAAC AGAGGGCATG AAATATTCCC TAGTTTCAAG AGAAGTTATT
GCTGATTCAA TTGAAACAGC ATGCAATGCT CAGAGTATGG ATGGAGTACT TGCTATAGGT
GGATGTGATA AAAATATGCC GGGTGCCATG ATTGCGATTG CAAGAATGAA TATTCCATCA
ATTTTCATTT ATGGAGGAAC AATAAAGCCT GGGAAATTGC ATGGAGAAGA TCTTACTGTT
GTTAGTGCAT TTGAAGCTGT TGGACAATTA ACATCAGGCA AAATTAATGA AGAAAGGCTA
ATCCAAGTTG AGAAAAATTG TATTCCTGGT GCTGGTAGCT GTGGAGGAAT GTTTACAGCT
AATACAATGT CTGCGGTTAT TGAAGTATTA GGGTTAAGTC TTCCTCACAG TTCCACTATG
GCTGCTGAAG ATCTTGAAAA AGAACTAAGT GCAGACAAAA GTGCTGAGAT ATTAGTCTCC
GCAATAGAAA AAGATATAAG ACCTCTAGAC CTAATGACTA AGAAAGCATT TGAAAATGCA
ATATCAGTAA TTATGGCAAT TGGCGGATCA ACAAATGCGG TATTGCACAT CTTAGCTATC
GCGAATACTG CAGGAATAGA TATCAACATT AATGATTTTG AGAGAATCAG ACAAAAAGTA
CCCGTTATTT GTGACCTTAA ACCGAGTGGT AAATATGTGA CGGTGGATCT TCATAAGGCA
GGTGGGATTC CACAAGTAAT GAAAATACTT TTGAATGCAG GATTAATTCA TGGCGATTGC
AAAAACATTG AAGGAAAAAC CATCTCAGAA TACTTACAAA ATATTCCAGA TAAGCCTCCA
ACAAATCAAA ATGTCATAAG AGACATAGAT AACCCTCTTT ATAAAAAAGG ACATCTAGCG
ATATTAAAAG GTAACTTAGC GAGCGAAGGT TCTGTAGCCA AAATTAGCGG AGTAAAAAAC
CCTGTATTAA CAGGTCCCGC AAAAATTTTT GAAAGTGAAG AAGATTGTTT AAAATCGATA
TTAAATAACG ATATCAAAGC TGGTGATGTT GTTGTTATTA GAAACGAAGG TCCTGTAGGA
GGACCAGGTA TGAGAGAGAT GTTAGCTCCA ACATCTGCAA TTGTTGGTCA AGGGCTAGGA
GAGAAGGTGG CTTTAATTAC CGATGGCAGA TTTAGCGGCG GTACCTATGG TCTTGTTGTG
GGTCACATAG CTCCAGAGGC TGCTGTAGGA GGAAATATTG CTCTAATAAA ACAAGGTGAT
TTAATTACAG TAGATGCTGT AAAACAACTA ATTGAAGTTG ATTTATCTGA CGAAGAATTA
GAAAAAAGAA AAAAAGATTG GGTAAAACCT ATTCAAAAAT ACAAAAGAGG AATTCTTTCA
AAATATTCGA GAATCGTAAG CACATCAAGT TTAGGGGCTG TTACTGATTT ATAA
 
Protein sequence
MNKLRSSAIT QGVQRSPNRS MLRAVGFNDE DFNKPIIGVA NGYSTITPCN MGLNKLALKA 
EESIKRSGGM PQMFGTITVS DGISMGTEGM KYSLVSREVI ADSIETACNA QSMDGVLAIG
GCDKNMPGAM IAIARMNIPS IFIYGGTIKP GKLHGEDLTV VSAFEAVGQL TSGKINEERL
IQVEKNCIPG AGSCGGMFTA NTMSAVIEVL GLSLPHSSTM AAEDLEKELS ADKSAEILVS
AIEKDIRPLD LMTKKAFENA ISVIMAIGGS TNAVLHILAI ANTAGIDINI NDFERIRQKV
PVICDLKPSG KYVTVDLHKA GGIPQVMKIL LNAGLIHGDC KNIEGKTISE YLQNIPDKPP
TNQNVIRDID NPLYKKGHLA ILKGNLASEG SVAKISGVKN PVLTGPAKIF ESEEDCLKSI
LNNDIKAGDV VVIRNEGPVG GPGMREMLAP TSAIVGQGLG EKVALITDGR FSGGTYGLVV
GHIAPEAAVG GNIALIKQGD LITVDAVKQL IEVDLSDEEL EKRKKDWVKP IQKYKRGILS
KYSRIVSTSS LGAVTDL