Gene Daud_0540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_0540 
Symbol 
ID6027661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp578324 
End bp580033 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content62% 
IMG OID641593376 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001716713 
Protein GI169830731 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAGCG AAGTCATCAA AAAGGGCGTG GAGCGCGCCC CGCACCGCAG CCTGCTGCGG 
GCCACCGGCA TCATTAAAGA CGAAGCGGAT TTCCAGAAAC CTTTTGTGGC CATTGCCAAT
TCCTTCGCCG AAATCGTGCC GGGACATGCC CATCTGAACG AATTTGTCAA GGAAATTAAA
GAAGCGGTGC GGGAAGCGGG GGGCGTTCCT TTTGAGTTCA ACACCCTGGC CTTGTGCGAC
GGCATTGCCA TGAATCACCC CGGCATGCGC TACAGCCTGC CTTCCCGGGA ACTGGTCGCC
GACAGTGTGG AAACAATGGT GCAAGGCCAC CGGTTTGACG GCCTCATCTG TATCCCCAAT
TGCGACAAAA TCGTGCCCGG CATGTTGATG GCGGCCGTGC GGCTCAATAT CCCCACCATC
TTTGTCTCCG GCGGCCCGAT GAAAGCCGGG CGGGCCAAGG ACGGCCGCCC GATAGACCTG
ATTTCTGTCT TTGAAGGGAT CGGGGCTTTC CGCGCCGGCA AGATCGATGC AGGCGACCTG
TTGGAGCTGG AGCAGGCCGC CTGTCCCGGT TACGGCAGCT GCGCCGGCAT GTTTACCGCC
AACTCCATGA ACTGCCTCTG TGAGGCCTTG GGCCTGGCTT TGCCCGGCAA CGGCACCATT
TTAGCCGTCG ACCCGAGAAG AAGTGAGCTG AAAAAGTGGG CCGGGCGCCA AATTGTAGAG
CTGATTAAAC GGGATCTCCG GCCACGGGAC ATCGTCACCC CGGAGGCGAT TGATAACGCC
TTTGCCCTGG ACGTAGCCAT GGGCGGCTCT ACCAACACCA TATTGCACCT TCTGGCTGTC
GCCCAGGAGG CAGGAATAAA CTACCCCCTT AAACGCGTCA ACCTGATATC CGCCCGTACG
CCCACCCTCT GCAAGATTTC TCCGGCCTCA AGCTTGCACA TCGAAGACGT GGACCGCGCC
GGCGGCGTCA GTGCCGTATT GGGCGAGCTA TCCCGCAAGC CCGGCCTGCT CAACCTGGAC
TGCCTGACGG TTACCGGAGA AACTTTGGGG GAAACCGTAG GCCAGGTCCA AAGCCTTGAC
CCGCGAGTCA TTCGCGGCGT AGAGGAGCCG TTGAGCCCCG TGGGGGGGTT GAAGGTCCTT
TTCGGCAGCC TGGCGCCGGA GGGGGCAGTG GTGAAGACGG CGGCGGTGGT GCCGCAGATG
ATGCGCCATC AGGGGCCGGC GGTAGTCTTC AATTCGGAGG CGGAGGCTTC TGCCGCCATC
CTGGGTGGGC GCATCAAGCA TGGCGATGTT GTAGTCATCC GCTTTGAGGG GCCGAAAGGC
GGTCCCGGCT TTATGGAGAT GCTGGGCCCC ACGGCGGCGC TGGTGGGGAT GGGGTTAGGT
GAGTCGGTAG CGCTAGTTAC CGACGGCCGC TTCTCCGGCG GCACGCGGGG TGCCTGCATC
GGCCACGTCT GCCCCGAAGC CGCCAGCGGC GGCCCCATTG CCCTGATCAA GGACGGCGAT
CTGATCTCTT ATGACCTGGA AGCGGGCACG CTGGAGCTGC TGGTGCCGCA GGAAGAGCTG
GCCGCCAGAA AGGCGGCCTT TACGCCGCCG CTCAGGCAGG GACTCACCGG CTGGCTGGCC
AGGTACGTCC AGATGGTTGC TCCGGCCAGT ATCGGCGCCG TGCTGCGTCC TGCATGCGGC
AGGCCGCCTG GGGAACAAGA CTACGAATAA
 
Protein sequence
MNSEVIKKGV ERAPHRSLLR ATGIIKDEAD FQKPFVAIAN SFAEIVPGHA HLNEFVKEIK 
EAVREAGGVP FEFNTLALCD GIAMNHPGMR YSLPSRELVA DSVETMVQGH RFDGLICIPN
CDKIVPGMLM AAVRLNIPTI FVSGGPMKAG RAKDGRPIDL ISVFEGIGAF RAGKIDAGDL
LELEQAACPG YGSCAGMFTA NSMNCLCEAL GLALPGNGTI LAVDPRRSEL KKWAGRQIVE
LIKRDLRPRD IVTPEAIDNA FALDVAMGGS TNTILHLLAV AQEAGINYPL KRVNLISART
PTLCKISPAS SLHIEDVDRA GGVSAVLGEL SRKPGLLNLD CLTVTGETLG ETVGQVQSLD
PRVIRGVEEP LSPVGGLKVL FGSLAPEGAV VKTAAVVPQM MRHQGPAVVF NSEAEASAAI
LGGRIKHGDV VVIRFEGPKG GPGFMEMLGP TAALVGMGLG ESVALVTDGR FSGGTRGACI
GHVCPEAASG GPIALIKDGD LISYDLEAGT LELLVPQEEL AARKAAFTPP LRQGLTGWLA
RYVQMVAPAS IGAVLRPACG RPPGEQDYE