Gene Daud_1041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_1041 
Symbol 
ID6027582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp1094025 
End bp1094981 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content61% 
IMG OID641593853 
Producthydrogenase (NiFe) small subunit HydA 
Protein accessionYP_001717185 
Protein GI169831203 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA)
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.225928 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCGCAAGC TTACCAGGCG GGAATTCGTA AAAATGTGCG GTATGTCCGC CGCCGGGCTG 
AGCCTGATGT CGCTGCTGGG TCCCCAGATC ACCCACGCAC TGGCCAAGGC GGTCGAGGAC
AAGGTGCCCG TCGTCTGGAT TCAGGGTGCG AGCTGCACCG GATGTTCGGT TTCGCTCCTG
AATGCCGTGG ATCCGTCCAT TGAGAAAGTG CTGCTCGAGG TTATCAGCCT GCGCTACCAC
CCGAACATAA TGGCCGCTTC CGGGTATCTG GGCACCGCGG TCATCGAGGA TGTGGCCGCC
CGGTTCGCCG GGGAGTTCAT CCTGGTCGTC GAAGGCGGCA TACCGGTGAA CGAGAAGGGC
AAGTATTGCG TAATCGGCAA ATTCGGTAAG AAAGAAATGA CCGCCCACGA GGCTCTGCTG
ACCCTGGGCG CCAAAGCCAA GGCGGTCGTG GCCGCCGGTC AATGTGCCGC CTTCGGCGGG
ATCCCGGCAG GGGCTCCGAA CCCGACCGGT GTGTTGGGCG TTGACGCGGT GCTCAATCCG
ATGCGCTATC GCCGGCCGCT GGCTAAAAAC GTGATCAATA TTTCCAACTG CCCGCTGCAT
CCGGACCACT TCCTCGGCAC CCTCACCTAT GTGTTGACCT ATAACGAAAT CCCCGAACTT
GACCGTTACG GGCGCCCGGT GATGTTTTAC GGGCAGTCCA TTCACGACAA CTGCCCCCGG
CGGCCTGACT TTGAAGCCGG CCGTTTCGCC GCCGTAATCG GGGACGAGGG CTGTCTGGCA
GGCCTGGGCT GCAAGGGGTT TATTGCTATG TCGGATTGTC CGCGACGGGG CTGGAACAGC
GGGACAAACT GGTGCATCGC GGCCGGGGCG CCGTGTTATG CTTGTTCGGA GCAGATCTTT
CCGGACGGGT GTTCCCCGAT TTACGGTGCG ATGCCGGTAA CCGGGAACGG CAGATAA
 
Protein sequence
MRKLTRREFV KMCGMSAAGL SLMSLLGPQI THALAKAVED KVPVVWIQGA SCTGCSVSLL 
NAVDPSIEKV LLEVISLRYH PNIMAASGYL GTAVIEDVAA RFAGEFILVV EGGIPVNEKG
KYCVIGKFGK KEMTAHEALL TLGAKAKAVV AAGQCAAFGG IPAGAPNPTG VLGVDAVLNP
MRYRRPLAKN VINISNCPLH PDHFLGTLTY VLTYNEIPEL DRYGRPVMFY GQSIHDNCPR
RPDFEAGRFA AVIGDEGCLA GLGCKGFIAM SDCPRRGWNS GTNWCIAAGA PCYACSEQIF
PDGCSPIYGA MPVTGNGR