Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_4156 |
Symbol | |
ID | 8431170 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | + |
Start bp | 4331509 |
End bp | 4332609 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 645036349 |
Product | putative transmembrane anti-sigma factor |
Protein accession | YP_003193447 |
Protein GI | 258517225 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0909715 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.000583924 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGACTGCC AAAAAACAAT GAATTTATTA TCACTCTACG TAGACGGCAG TCTCGACAAT AAACCAGACG ACCGTGCTAT AAAAGATCAC CTGTCGGCTT GCAAAGCCTG CAGCAGCGAG TTTGCTTTGC AAAAAAGGCT TTCCACCGCT ATGAATAGTT TTAAGAGTGA GGATATAACA GCACCGCCTG ATTTATGCGC CAATATCATG GGGCAACTAA AACAGGAGCG CAAAAAAGTT TTCCACCTGC TTCCTGCCGC CTGGCGCAGA ACAATTGCCG CGGTTGCCGC AATACTGCTT ATGGCAGGCA TGTCCTCAGG GATTACAAGC AGCTTGCTGC CGGTAGCCAA CAATGATAAG CCGAATGCGG CGCGGCCCTC ACAAGTTGCC TCAACTGATA ACGGCGCTGC GGCAGAAGTT AAACCTGAGA CGCATGATCC AAACCGGTCC AAAGATGCCG AGCAACAGCC GGACAGTAAC GCAAATGAAT CAAAAGTTGA TGTTTCATCT AAAGAAACCA CGTCAGGCAA CGATGTCAAA AAGAACGGCA CGGCTATGAC AAACACGGAA AGCAACACCG GAGAGGTTCC GTCAGGCACA ACCAAGAAGC CTACAGAAGT TGGTCAAAGT TCCCCTTCTG TTAAAGCAAC ACCTTCATAT GCAGAGAAAA CCGCATTTCT CAGTAAAAAT ATGGTGATAA CCAGTACCGT CTTAAAAATC TCGGTAAATG ACTTGTCCGA AGCCAAGATA AAAGCAGTAG CCTTGGCCGC CGGCGCAGGA GCCTCAAACC AGCTGTTTCC CGAGGAAGGC GGCTTGCTCA TGAGATTGGC TACTCCGGCT GAGCAGGCAC AACAGCTCAT CAACGGGCTG TCCGGATTAG GCACGACGAT GGACAGACAA GATGAAAACA GGGACATAAC TTCTTCTTAC AACAAAGCTT CTGTACAATA TGCCGAACTG CAAGCCAGAA TAAGTGCATC GACTGATACA GAAGAGCGCA GGCAATTAGA AAACCAGGCG GCAGGTTTTA AGAGGACTAT GGATTCATAT GAAGCTGATG CCGGTAAGAG GGTAATAGTT TTATGGATAG AAAAAAAATA G
|
Protein sequence | MDCQKTMNLL SLYVDGSLDN KPDDRAIKDH LSACKACSSE FALQKRLSTA MNSFKSEDIT APPDLCANIM GQLKQERKKV FHLLPAAWRR TIAAVAAILL MAGMSSGITS SLLPVANNDK PNAARPSQVA STDNGAAAEV KPETHDPNRS KDAEQQPDSN ANESKVDVSS KETTSGNDVK KNGTAMTNTE SNTGEVPSGT TKKPTEVGQS SPSVKATPSY AEKTAFLSKN MVITSTVLKI SVNDLSEAKI KAVALAAGAG ASNQLFPEEG GLLMRLATPA EQAQQLINGL SGLGTTMDRQ DENRDITSSY NKASVQYAEL QARISASTDT EERRQLENQA AGFKRTMDSY EADAGKRVIV LWIEKK
|
| |