Gene NATL1_08121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_08121 
SymbolilvD 
ID4781279 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp748362 
End bp750032 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content42% 
IMG OID640084087 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001014635 
Protein GI124025519 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTAGAT CAAATGCAAT AACACAGGGT ATTCAACGAT CACCAAACCG AGCAATGCTT 
AGAGCTGTTG GGTTTGATGA TAATGATTTT AATAAGCCAA TCATTGGAAT CGCTAATGGG
CACAGCACAA TAACCCCATG CAATATGGGA TTAATGGATT TAGCTAATAG AGCAGAGTCA
GCTTTAAAAG AGGCTGGTGC AATGCCTCAA ACTTTCGGAA CCATAACTGT TAGTGACGGC
ATTTCCATGG GAACAGAAGG GATGAAATAC TCATTAGTTT CAAGAGAAGT TATTGCAGAT
TCAATCGAGA CGGCTTGCAA TGCTCAAAGC ATGGATGGGG TGTTAGCTAT AGGTGGATGT
GACAAAAATA TGCCAGGAGC AATGTTATCA ATGGCAAGAA TGAACATTCC TTCGATTTTT
GTTTATGGGG GGACAATTAA ACCTGGGAAA TTAGACGGTT GCGACTTGAC CGTGGTGAGT
TCATTTGAGG CTGTCGGTCA ATTAGCAAGC GGAAAAATCG ACAAAGATCG TTTAATAGCT
GTAGAAAAAA ATGCTATCCC TGGACCAGGA AGTTGCGGAG GGATGTTTAC TGCTAACACC
ATGTCTGCGG CAATCGAAAC GATGGGGTTT AGTTTGCCAT TTAGTTCAAC AATGGCTGCA
GTAGATGATG AAAAAGCAGA AAGCGCTGCT GAAAGTGCTC AGGTCCTTGT CAATGCAGTA
AAGAACAACA TCAGACCATT AGATTTACTT ACAAAAGAAG CCTTTGAGAA TGCAATCAGT
GTCATCATGG CTGTAGGCGG CTCAACAAAT TCTGTCCTTC ACCTACTTGC TATAGCTAGG
ACTGCAGGAG TTGATTTAAC AATAGATGAC TTTGAACGTA TAAGGCAGAC TGTTCCTGTG
ATTTGCGACC TCAAACCAAG CGGTAAATAC GTAACAGTCG ACCTACATAA AGCTGGTGGG
ATTCCTCAAG TAATGAAAAT ACTCCTCGAT GCAGGAATGC TCCATGGAGA ATGCAAAACC
ATTGAAGGAA AAACAATTAA GGAAGTCCTT AGAGATATCC CCTCAAAGCC CAAAGAAAAT
CAAGACGTGA TAAGACAAAT ATCAAACCCT ATTTATAAAA AAGGACATCT AGCTATTCTC
AAAGGAAATT TAGCAAGTGA AGGAAGTGTC GCAAAAATTA GTGGTGTGAA GACTCCCGTT
TTAACCGGGC CTGCAAGGGT CTTTGAGAGT GAAGAAGAAT GCTTAACTGC CATTCTTGAC
AATAAAGTTA AAGCTGGAGA TGTAGTCGTC GTTAGATATG AAGGACCTGT AGGAGGGCCT
GGCATGAGAG AGATGCTCTC TCCAACCTCA GCAATTGTAG GTCAAGGATT AGGAGAGAAA
GTTGCACTAA TTACTGATGG TCGATTTAGT GGCGGATCAT ATGGATTAGT GGTTGGACAC
GTTGCTCCAG AAGCAGCCGT CGGAGGAACA ATTGGATTAG TAGAGGAAGG AGACAGTATT
ACCGTTGACG CTAATAAATT GTTAATTCAA TTAAATGTCG AAGAGCGAGA ATTAGCTAGA
AGGAAAGAAA AATGGGAGAA GCCAAAGCCT AGATATAAGA CAGGCATTCT TGGGAAGTAT
TCCAGATTAG TAAGTTCATC AAGCCAAGGA GCGACAACTG ATCAAATATA A
 
Protein sequence
MLRSNAITQG IQRSPNRAML RAVGFDDNDF NKPIIGIANG HSTITPCNMG LMDLANRAES 
ALKEAGAMPQ TFGTITVSDG ISMGTEGMKY SLVSREVIAD SIETACNAQS MDGVLAIGGC
DKNMPGAMLS MARMNIPSIF VYGGTIKPGK LDGCDLTVVS SFEAVGQLAS GKIDKDRLIA
VEKNAIPGPG SCGGMFTANT MSAAIETMGF SLPFSSTMAA VDDEKAESAA ESAQVLVNAV
KNNIRPLDLL TKEAFENAIS VIMAVGGSTN SVLHLLAIAR TAGVDLTIDD FERIRQTVPV
ICDLKPSGKY VTVDLHKAGG IPQVMKILLD AGMLHGECKT IEGKTIKEVL RDIPSKPKEN
QDVIRQISNP IYKKGHLAIL KGNLASEGSV AKISGVKTPV LTGPARVFES EEECLTAILD
NKVKAGDVVV VRYEGPVGGP GMREMLSPTS AIVGQGLGEK VALITDGRFS GGSYGLVVGH
VAPEAAVGGT IGLVEEGDSI TVDANKLLIQ LNVEERELAR RKEKWEKPKP RYKTGILGKY
SRLVSSSSQG ATTDQI