Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_3481 |
Symbol | |
ID | 8430476 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | - |
Start bp | 3678702 |
End bp | 3680147 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 645035705 |
Product | hypothetical protein |
Protein accession | YP_003192823 |
Protein GI | 258516601 |
COG category | [R] General function prediction only |
COG ID | [COG1524] Uncharacterized proteins of the AP superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGGATTGC TGCTGTTGGT AATCGGGGCG GCTGCTTTAA GCTTTATATA TAAGGCTCCC TGTTACATAC CGCCATTAAG GATAATTGGT GATGTTTCCA ACAGCTATTG TCTGCAAAGC CCAAACGAAA TCGGTAAGCT GGAACAGATT AGTTTTCAGG GAACCAAATA CAAGGCCATT AAACTGTCGG ATATCATCAG TAAGGCTGAA CCGGTGGCCA ATCCAAGTCA GCTGTATTTA GCCGGTCTGG ATGGGTTTAC TCCGGCTATC AAAGCAGCAG AGATTGAAGA TTGCTATATT TCTTTTACTC ACCAAAATGG CTGGGAAGCT GTCAATTTGA AACACCCAGT GAGCAGCAAT ACCAAAATGC TTACAGAAAT CGTGGTTGTA TCGGATGGCA GTTCCGGGGA TTTTGCCTTA AATGTTATAG ACACGGAAAA CAACCTGGTG CGGGTTACAC CGGGGCAGTT GCTGAGCAGG CCGTTAACGC GGTATTTTTA CCCGGAGGGG AGGGCCGCCG TTCAAAACAA TGGTAAAGAT TATGAATCTC AGGTTTACAC CAAGAGGCTG GTTTTTAAAT TAAGTGACGT AACGCCGGTC AAAGAAGGAG ATAACCTCCT GGTAATGACG GAAAAGGGAA AATACCGTAT GGTGGATAAC AGCGGTTATT TTGAAGTAAG AGATAATAAT ATCAGTTACC TGCAGCCAGA GGACCGGACT ATTCTGGAAC AAGTGCGGGG AGTAATCCTT CGCCCGCCCG CCGCCAGCAT CATGGACACT TATTATGATG CACGGCATTA TCTTGAGGGA GGCGACAGGT TGCTGGTGCT AGTGCTGGAC GGGCTTAACT ACAATCAGTA CAGCTATGCT GCGGCTAACG GCTATATGCC GTTTCTAAAA AGATACGGGA CTGCCGTAAA GGCTTCCGGT GTTTATCCGC CTGCGTCCAA TGTGGGTTTG GCTGCTCTGT TAACCGGTCA AGCTCCTGAA GAAAATGGTA TTGTAAGCGA AAAAGACCGG CAGTTAAAAG CATCATCGAT TTTTGCGGAG GCGAACAGGT TGAGTAAAAA GGTACTGTTT TTGGAGGCTG CTCCAAACCG GCTTGATACT GAAATACAAC CACTGCCCGT TACCGATCGG AATTCCGATG GCAATACTGA TGATGACCTG TATGAAACGG CCTTGGCTAA TCTGGATAAG GGCTATGATT TAATAATGGT TCGCTTTCAT GATATTGATG AAACCGGGCA GCGTTACGGT GAAATAGCCA GGCCGACGAT GCAGGCAATT AGTTCGTTAG ATAATTATCT GTCTAAAATT ATCAGTAAAT GGTCCGGTAA AGTTATAATT ACTGCCAATC AGGGAAGCAT GTCCGGTAAA TTAGTCGGAG CGGAAGCGAT ATTCAGCAAT AATAATATGT TTGTGCCTTA CTGGCGTATT CCATAA
|
Protein sequence | MGLLLLVIGA AALSFIYKAP CYIPPLRIIG DVSNSYCLQS PNEIGKLEQI SFQGTKYKAI KLSDIISKAE PVANPSQLYL AGLDGFTPAI KAAEIEDCYI SFTHQNGWEA VNLKHPVSSN TKMLTEIVVV SDGSSGDFAL NVIDTENNLV RVTPGQLLSR PLTRYFYPEG RAAVQNNGKD YESQVYTKRL VFKLSDVTPV KEGDNLLVMT EKGKYRMVDN SGYFEVRDNN ISYLQPEDRT ILEQVRGVIL RPPAASIMDT YYDARHYLEG GDRLLVLVLD GLNYNQYSYA AANGYMPFLK RYGTAVKASG VYPPASNVGL AALLTGQAPE ENGIVSEKDR QLKASSIFAE ANRLSKKVLF LEAAPNRLDT EIQPLPVTDR NSDGNTDDDL YETALANLDK GYDLIMVRFH DIDETGQRYG EIARPTMQAI SSLDNYLSKI ISKWSGKVII TANQGSMSGK LVGAEAIFSN NNMFVPYWRI P
|
| |