Gene Daud_1904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_1904 
Symbol 
ID6026477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp2007071 
End bp2008189 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content67% 
IMG OID641594722 
Productprephenate dehydratase 
Protein accessionYP_001718029 
Protein GI169832047 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase
[COG1605] Chorismate mutase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0916421 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAACCA TAGCCTATCT CGGGCCGGAG GGAACTCATT CGGAAGAAGC GGCTTCCCGG 
TGGGCGGGTG ACCGCCCGAT GCTTCTCCGC CCATTGCGTT CCCTGGTCGA AGTGGTCGGG
GCGGTGGAGG GCGGGAGCGT GGACTGGGGT CTCCTGCCGG CGGAAAACTC GGGCGAAGGT
TCCCTGGGAC TGACACTGGA CCTTTTGGCC CACCAGGCCG ACCGGGTCCA GATCTGCGGG
GAGGTGGTGC TCCGTATCCG GCACCACCTG CTGGCCCGCC CGGGGGTGAG CCGGGAGCGG
GTCACCCGGA TCATTTCTCA TTCCCAGGCG CTGGCACAGT GCCGCGAGCA CCTGGCCCGA
GACTTTCCCG GGGTCGAACT GGTGGAGAGC ACCAGTACCG CCGAGGCGGC ACGAGCGGTG
GCGCAAACCG GCCGGCCGTG GGCGGCGGTG GGCACCCGGA AAGCGGCTCG GCTGCACGGC
CTGTCGGTGT TGGCGGAGGA CGTGGCCGAC CTCAAGGAGA ACGCCACCCG CTTCCTGGTG
ATCGGGCGGC GGGGCTGCCG GACCGGGCCG GGCGACAAGA CCACGGTCCT GGTCGCGGTC
GATGGCCGCC GTCCTGGTTC CCTGTACCGC CTGCTGGGCG AATTCGCGCG CCGGGGCATC
AACCTGACGC GCATTGAATC GCGGCCGGCC AAGACCCGGC TGGGGGAATA CATTTTCTTC
ATCGATCTGG AGGGACATCC GGGTGAACCC GAGGTTGACG AAGCTCTGGC TGGCGTGCGG
GCGAGAAGCA GTTTTTGCAA AATCCTGGGA TCCTACCCGG CGGACGGTGC TTCTCAGACG
CCGCGGGACC CGGTGTCGTC CGACCTGGAG ACGATCCGGG CCGAAATCGA CGTGACCGAC
AGCCAGATTG TGGCCCTCTT GGCCGAGCGG GCCGAACTGG CGCGCCGGGC CGGGAAATTC
AAGGACGGGA GACCGGTGCG CGACCCGGAA CGGGAAGCGG AGATCAAGGA ACGGCTGCGG
GCGCTGGCCG TGAGGAAGGG ACTCGATGCC GACATAGTCA CCGGAGTCTA TGAGTTGCTG
CTGCCTTATT TCGTCGAGTT GCAGGGTGGC CCCGGCTAG
 
Protein sequence
METIAYLGPE GTHSEEAASR WAGDRPMLLR PLRSLVEVVG AVEGGSVDWG LLPAENSGEG 
SLGLTLDLLA HQADRVQICG EVVLRIRHHL LARPGVSRER VTRIISHSQA LAQCREHLAR
DFPGVELVES TSTAEAARAV AQTGRPWAAV GTRKAARLHG LSVLAEDVAD LKENATRFLV
IGRRGCRTGP GDKTTVLVAV DGRRPGSLYR LLGEFARRGI NLTRIESRPA KTRLGEYIFF
IDLEGHPGEP EVDEALAGVR ARSSFCKILG SYPADGASQT PRDPVSSDLE TIRAEIDVTD
SQIVALLAER AELARRAGKF KDGRPVRDPE REAEIKERLR ALAVRKGLDA DIVTGVYELL
LPYFVELQGG PG