Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_0987 |
Symbol | |
ID | 8427926 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | - |
Start bp | 1010072 |
End bp | 1011307 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 645033325 |
Product | protein of unknown function DUF224 cysteine-rich region domain protein |
Protein accession | YP_003190499 |
Protein GI | 258514277 |
COG category | [C] Energy production and conversion |
COG ID | [COG0247] Fe-S oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.965573 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00212312 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCTAAGTC AACCGGATCA ATTAAAAGAT ATACAGAGTC AATTTCAAAA ATGCGTGCGC TGCGGGCTCT GCCGATCGGT CTGTCCTATT TTTAAAGAAG ACCGCAGGGA AACCGCAGCT CCCAGAGGCA AAGTATTTTT AGCACAAATG CTGGCAGAAG GAGAAATTAC ACCTGACAGC AAAGCCGCCC AAAACCTTTC CATGTGCCTG ATGTGTGAAT CCTGTTCCAG CGAGTGCCCT TCGGGAATCG AAGTGCACAA AATAGTCAGC CTGGCTCGAT CCATGGTTAA TGAAAACAAT CCTTCATTAA CAAACAAGGC AAATAAATTG ATCTTCAAAG ATTTATGGAG CAAACCTTCT TTAATGAATT TAAGTTTCAA TCTAATTAAA ACCGGTCAAG CACTCGGACT GCTGGATTTC GGCACCAAAT CAGGACTGCT GCCAAAATCA GGACGCCTGC TGGGTGAACT GCCAGGGAAA CCGGCACGAC AGGCACTGCC GGAAATAGTA CCCCCCACAA CCAGACAAAA GGCGCGCACC GGGTATTTTT TGGGCTGTGC CACCAACTAC CTCTATCCTC AAGTGGCTTT CAGTACAGTA AAAATACTGT CACACCTTGG CTGTGAAGTA GTAATACCTC GTGAACTAAC CTGCTGCGGC TTGCCTCAGC TGGCTAACGG CGAACCTGCT GCCGGACACA ATTTAGCCAG GCAAAACTTG CAAATCTTTA AACGGGCCGG GGTTGAAGCA GTAGTCTGTG ACTGTGCTTC TTGCAGTGCT ACCTTAGCGG AAAACTGGGG ACAAGCTCTA CCGGTATATG ACGCCGTAAA ATATATTATA CAGGAATTAA AGCTGGATTT GTCAGATAAA AAACAAATTA ATAATCAACC AATTAAAATA GTAACCTACC ATGATCCCTG CCACCTAGCC AAAGCACAAA GAATCAGGCA GCAGCCACGA CAATTACTGC AGATGCTGCC GGGCGTGGAA TACAGGGAAA TGCCCGGGGC CGATAACTGC TGCGGCGGTG CCGGCACTTT CGTCGTGAAA AACTATGATC TGAGCATGCG TATTCTGGAT CGAAAAATCG CATCCATCAA AGAAACCGGT GCTGACATTG TAGCCACCTG CTGTCCTACC TGTACTATGC AGCTTAAACA CGGTTTGGAT AAGCACGGAC TTCAAATTGA AGTAAAACAC CCACTGGAAC TCCTGGCCGA GACACTCGGG CTATAG
|
Protein sequence | MLSQPDQLKD IQSQFQKCVR CGLCRSVCPI FKEDRRETAA PRGKVFLAQM LAEGEITPDS KAAQNLSMCL MCESCSSECP SGIEVHKIVS LARSMVNENN PSLTNKANKL IFKDLWSKPS LMNLSFNLIK TGQALGLLDF GTKSGLLPKS GRLLGELPGK PARQALPEIV PPTTRQKART GYFLGCATNY LYPQVAFSTV KILSHLGCEV VIPRELTCCG LPQLANGEPA AGHNLARQNL QIFKRAGVEA VVCDCASCSA TLAENWGQAL PVYDAVKYII QELKLDLSDK KQINNQPIKI VTYHDPCHLA KAQRIRQQPR QLLQMLPGVE YREMPGADNC CGGAGTFVVK NYDLSMRILD RKIASIKETG ADIVATCCPT CTMQLKHGLD KHGLQIEVKH PLELLAETLG L
|
| |