Gene Daud_1905 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_1905 
Symbol 
ID6026579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp2008203 
End bp2009168 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content68% 
IMG OID641594723 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001718030 
Protein GI169832048 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000989791 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTACCGC GGAAGCCCGG CTTTCGTCCC GGACGAAGGC CGGGCTTTTT TATTCTTAAT 
ACAAAGGGGA TGCTTGAGAT GATGGGGATG GAATTGGTAG AACCGCAAGC GGTGCCCGGG
GCGGTCCGGG CCTGCGCGGC GGGGCGGCCG TACCGGCTGG CCGGCAGGGG CCACGGCGGC
GCAAACACGG TGGTCCGGGT GGGCCGGGCG GAGTTCGGCT CTGGCGCGGT GAACGTAATC
GCCGGCCCGT GCGCCGTGGA GAGCCGGGAA CAAATGACGG CCGCGGCCCG GGCGGCGGCC
GGGTCCGGGG CGAAGGTCCT GCGGGGCGGC GCCTACAAGC CCCGGACTTC TCCCTACAGT
TTCCAGGGCC TGGAGCGCGA GGGGCTCGAG TTGTTGGCCG AGGCCGCGGC GGCGGCGGGG
TTGGCCAGCG TGACCGAAGT GATCGACGAG GAGAGCCTGG CGGCGGCGGT AGAGTACGTG
GATATGCTCC AGGTCGGTTC GCGGAACATG CAGAACTTCC ACCTGCTGCG GGCGGTGGGG
CGGGCGAACA AGCCGGTGCT TTTGAAGCGC GGGTTCTCCG CCACGATCGA GGAGTGGCTG
ATGGCGGCCG AGTACATCCT GGCCGGGGGA AATACCCAGG TGGTGCTGTG CGAGCGGGGC
ATCCGTACTT TTGAGACCTA CACCCGGAAT ACGCTGGACT TGAGCGCCGT CTCCCTGGTG
AAAAAACTGA GCCACCTGCC GGTGATCGTC GACCCGAGTC ACGCCACCGG CAGGGCGGAA
CTGGTCGCTC CGATGTCCCT GGCGGCGGTG GCGGCCGGGG CGGACGGGAT CATCGTCGAG
ATGCACCCGG AGCCCGAAAA AGCCCTCTGT GACGGCAAGC AGTCCCTGGA CCCGGCCGCC
TTTGACCGGC TGATGCGGGA AGTGGACATC ATCGCCCGGG CGTTGAACCG GGGCGTTGTG
GATTGA
 
Protein sequence
MVPRKPGFRP GRRPGFFILN TKGMLEMMGM ELVEPQAVPG AVRACAAGRP YRLAGRGHGG 
ANTVVRVGRA EFGSGAVNVI AGPCAVESRE QMTAAARAAA GSGAKVLRGG AYKPRTSPYS
FQGLEREGLE LLAEAAAAAG LASVTEVIDE ESLAAAVEYV DMLQVGSRNM QNFHLLRAVG
RANKPVLLKR GFSATIEEWL MAAEYILAGG NTQVVLCERG IRTFETYTRN TLDLSAVSLV
KKLSHLPVIV DPSHATGRAE LVAPMSLAAV AAGADGIIVE MHPEPEKALC DGKQSLDPAA
FDRLMREVDI IARALNRGVV D