Gene Daud_2004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_2004 
Symbol 
ID6026520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp2111307 
End bp2112602 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content57% 
IMG OID641594826 
Producthypothetical protein 
Protein accessionYP_001718127 
Protein GI169832145 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAACAAC TGCTTCCTCA AATCGGGCTG ATACTCTTAC TTATTTTGCT CAACGCGGTG 
TTTGCCTCCG CGGAAATCGC CCTGGTCTCC GTCCGCCGTT CGCGCATTGA CGTCCTCGCC
AAAAAGGGGG ACCGCCGCGC CGCTGCCGTG GCTCGACTCC TCAAAGGGGA TCCGGGCCGC
TACTTGGCGG CTATCCAGAT CGGCGTAACC CTGGCCGGGT TTCTGGCCAG CGCCACCGCC
GCCGTCACCC TGGCCGTGCC GCTGCAGAAC ATCTTTCAAA ACGTACCCGT AGCCGCAGTC
AACACTAATG CGCAGGGAAT CGCCGTGGTA ATCACCACCA CGCTGATCGC GTTAATCACC
CTCATCTACG GCGAGTTGGT CCCTAAACGG GTGGCGTTGC AGGCAACTGA ACGTGTCGCC
CTCCTTCTGG GCCGACCCAT TCACTTGTTC TCGCGCGCAA CACGCCCGGT AATCCTTCTG
CTGACCGCAG CCACCAACTA CTCGCTGCGC CTGTTCGGAT TAAAGCCCGG TGTTAACGAA
GACCAGGTGA CCGAAGACGA ACTCAAACAA ATCATTGTAA ACCAAAGCAC CCTGGACCGG
GAAGAGCAAC GGCTTCTTTG GGACGTCTTC GACTTCGGAG ACGCGGTGGC TTATGATGTA
ATGGTCCCGC GCACCGACGT GGTAGGGGTC GAAACCAGCA CTTCTGTGGC GGACACCCTT
CGTCTGATGT CGGAAACAGG CCATTCCCGC ATTCCGGTCT ATGGGCAGAA CCTTGACGAC
ATCAAAGGTA TTGCCGGGAT CAAGGACCTG GTCCCTTATC TCTTGCGCGG GGAGGAGCAG
GCGCCGGTAG AGAAGGTGGT TCGCCCGGCC TACGTTGTCC CGAATACTGT TCCGATCAGG
CAGTTGCTCC GTGACCTACA GAAGCGCGGG GTGTCAATGG CCGTGATCGT AGATGAATTC
GGCGGGACTG ACGGTGTTGT CACCGTGGAG ACTCTGCTCG AAGAGCTTGT AGGAGAAATC
CGCGACGAGT ACGACCGGGA GGACCAAGAA ATCTTATCTT CAGAAGACGG GCAAGCGATC
GTCAAGGGTT CGGCTGGAGT GGATGAGGTC AACCGCCAAC TAAAACTGGC GATCCCAGAG
AGCGAGGAAT ACCATACGAT CGCCGGTTTC ATCCTCGATC AGCTCAACAA GGTGCCAAAA
GCCGGGGACC GTGTGACTTT AGACGGTACC GTACTTGAGG TCGCAAAAAT GAAGGCGAAC
CGCATCTTGA TGGTTTCGAT CAAAAAAGAA GATTGA
 
Protein sequence
MEQLLPQIGL ILLLILLNAV FASAEIALVS VRRSRIDVLA KKGDRRAAAV ARLLKGDPGR 
YLAAIQIGVT LAGFLASATA AVTLAVPLQN IFQNVPVAAV NTNAQGIAVV ITTTLIALIT
LIYGELVPKR VALQATERVA LLLGRPIHLF SRATRPVILL LTAATNYSLR LFGLKPGVNE
DQVTEDELKQ IIVNQSTLDR EEQRLLWDVF DFGDAVAYDV MVPRTDVVGV ETSTSVADTL
RLMSETGHSR IPVYGQNLDD IKGIAGIKDL VPYLLRGEEQ APVEKVVRPA YVVPNTVPIR
QLLRDLQKRG VSMAVIVDEF GGTDGVVTVE TLLEELVGEI RDEYDREDQE ILSSEDGQAI
VKGSAGVDEV NRQLKLAIPE SEEYHTIAGF ILDQLNKVPK AGDRVTLDGT VLEVAKMKAN
RILMVSIKKE D