Gene Daud_1183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_1183 
Symbol 
ID6027523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp1238479 
End bp1239492 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content62% 
IMG OID641593998 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001717326 
Protein GI169831344 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.337643 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCATTG TAATGCACCA TAAGGCGAGC GAAGCTGAGA TTGAGGCAGT CGTGAAGAGA 
ATAGAGTCGG CCGGCTACCG GGCCCACCTG TCCCGCGGGG TGGAGCGGAC CATTATCGGG
GCCATCGGCG ATGAGACCCT CCTCGGGGAT GCCGGGATTG AACTCCTGCC GGGGGTGGAC
AAGGTTATTC CGATCATGGC CCCCTACAAG CTGGCCAGCC GGGTGATGAA AGCAGAAGGC
ACGGTGATCA CCGTCGGGGA CGTGACCATC GGGGGCGACA CCATCCAGGT GATGGCCGGC
CCATGTGCGG TGGAGAGCAA GGAACAGCTG TTCGAGGTGG CGGAAAAAGT AAGGGCTGCC
GGGGCCCGGA TTCTACGGGG CGGCGCCTAC AAGCCCCGCA CTTCTCCGTA TTCGTTCCAG
GGACTGGCCG AGAAGGGGCT GCAACTCCTG GCCGAGACCC GGGAGCGGTA TGGTCTTCTG
ATTGTGACCG AGGTAATGGA CGTCCGGACT CTGCCGCTGG TGGCCGAGTA CGCGGACATC
ATCCAGATCG GCACCCGGAA CATGCAGAAC TTCTACCTGC TGCGCGAGGT CGGCCGGTAC
AGCAAACCGG TTCTGCTGAA ACGTGGCCTG TCGGCCACCA TCGAAGAATG GCTGATGGCG
GCCGAGTACA TCCTGAACGA GGGGAACCAG AACGTAATCC TGTGCGAACG CGGGATCCGC
AGTTTTGAAA CCTTTACCCG GAACACGCTG GATCTTTCGG CTGTGCCGAT CGTAAAGTAT
CTCTCCCACC TGCCGGTGGT GGTGGACCCC AGTCACGGCA TCGGTAAGTA CCGGTTTGTG
CCGCCGATGG CCCTCGCCGC GGTGGCCGCC GGGGCCGACG GCCTCTTGAT CGAAGTTCAC
CCCAACCCGG CGGAGGCCTT GTGTGACGGG GCGCAGTCCC TGACCCCGAA GAAGTTCGGG
AAGACTATGG TTCAACTGGC GCAGATCGCA CAGGCGGTCG GCCGGAGAGT TTAG
 
Protein sequence
MVIVMHHKAS EAEIEAVVKR IESAGYRAHL SRGVERTIIG AIGDETLLGD AGIELLPGVD 
KVIPIMAPYK LASRVMKAEG TVITVGDVTI GGDTIQVMAG PCAVESKEQL FEVAEKVRAA
GARILRGGAY KPRTSPYSFQ GLAEKGLQLL AETRERYGLL IVTEVMDVRT LPLVAEYADI
IQIGTRNMQN FYLLREVGRY SKPVLLKRGL SATIEEWLMA AEYILNEGNQ NVILCERGIR
SFETFTRNTL DLSAVPIVKY LSHLPVVVDP SHGIGKYRFV PPMALAAVAA GADGLLIEVH
PNPAEALCDG AQSLTPKKFG KTMVQLAQIA QAVGRRV