Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_1961 |
Symbol | |
ID | 8428943 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | + |
Start bp | 2084156 |
End bp | 2089621 |
Gene Length | 5466 bp |
Protein Length | 1821 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 645034289 |
Product | Ig domain protein group 2 domain protein |
Protein accession | YP_003191420 |
Protein GI | 258515198 |
COG category | [N] Cell motility |
COG ID | [COG5492] Bacterial surface proteins containing Ig-like domains |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0983337 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.347349 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAATC TATTCATACC AATGTATAAA ATCCTAGTGA GGGTTCTGCT CATTATCGTT GTGGCATCAG GCTTTCTATT TAACTCTCTT GACCTGCAGG GAACGAAAGT CGAAGCGTCA ACAAGCGGTG CAACTACATT GAGTATTGAG CCGGTGACAA AGAATGTCAG CGCCGGTGAC ACCTTCACCC TGAATGTGTT GGTTGCCCCG GCTACAGCAA TAGCCGGGGC GCAATTCAAC TTGAGCTACG ACCCGGCAGT GCTGCAAATT AACTCAGTAA CCGAGGGTGG ATTGCTCAAG CAGAACGGCA ATACTTCCTT CTTTCTGCCT GGCGACATTG ATAATACCGA AGGGTTGCTT AAGAATGTTG CCGGAGCTAT CACTACCTCC GGCGGCGAAG TGAGCGAGGA GGGAGTCTTG GCCACCGTTA CTTTTACAGC TAAGGCCTCC GGTACATCCA CCCTCACTCT GAGCAATGTC ATAGCCGGCA GTAAGGTGGG TCAGTCCGTA CCGGTACAGG TTATTGGCGG CAGCGTCACT GTACAGGGAG GAACAAGCAG CGAATCGGTT AGCGGAGTCA GCTTGGATCA AACCAGCTGC AGCCTGACCG TGGGAGAAAC CGGCCAGCTA ACCGCCACTG TCCAGCCGGC AAATGCCAGC AACAAAAATG TTACCTGGAC ATCAGACAAT GAAGCAGTGG CCACAGTAGA CGCCACCGGC AAAGTAACGG CAGTATCCGC CGGCACGGCC AATATCACCG TGACCACGGC AGACGGGGGC TTCACCGCCA CCTGCGCAGT CACTGTACAG GGAGGAACAA GCAGCGAATC GGTTAGCGGA GTCAGCCTGG ACAAAACCAG CTGCAGCCTG ACCGTGGGAG AAACCGGCCA GCTAACCGCC ACTGTCCAGC CAGCTAACGC CAGCAACAAA GACGTTATCT GGAGTTCAGA CAACGAAGCA GTGGCCACAG TAGACGCCAC CGGCAAAGTA ACGGCAGTAT CCGCCGGCAC GGCCAATATC ACCGTGACCA CGGCAGACGG GGGCTTCACC GCCACCTGCG CAGTTACAAT TACAACCGGC AGTGCCACTA TAGTGGGCAT CGACCCGGCG ACGAAGACTG TCAGTGCCGG TGACACCTTT GACCTGGATG TTCTGATTAC CCCGGCCACA GCGATAGCCG GAGCGCAGTT CAATCTGAGT TACGACCCGG CAGTGCTGCA GGTTAACTCA GTAACCGAAG GCGGATTGCT CAAACAAAAC GGCAACACTT CCTTCTTCCT GACCGGCGTC ATTGATAATA ACAGCGGGCT GCTCAATAAT GTTGCCGGAG CTATTACTAC TTCCGGCGGA GAAGTGAGCG GAGCGGGATC TCTGGCTGTC ATCTCATTTA CGGCCAAGGC CACAGGTACG TCCACCCTTA CTTTGAGCAA TGTCATAGCC GCCAATAAAG CGGCCCAAGC CGTACCGGTA CAGGTTAACG GCGGCAGCGT CACTGTACAG GGAGGAACAA GCAGCGAATC GGTTAGCGGA GTCAGCCTGG ACAAAACCAG CTACAGCCTG ACCGTGGGAG AAACCGGCCA GCTAACCGCC ACTGTAGCAC CGGCAAATGC CAGCAACAAA AATGTCACCT GGACATCAGA CAATGAAGCA GTGGCCACAG TAGACGCCAC CGGCAAAGTA ACGGCAGTAT CCGCCGGCAC GGCCGATATC ACCGTGACCA CGGCAGACGG GGGCTTCACC GCCACCTGCG CAGTCACAAT TGCAACCGGC AGCGTCACTA TAGTGGGCAT CGACCCGGCG ACGAAGACTG TCAGTGCCGG TGACACCTTC AACCTGGATG TTCTGATTAC CCCGGCCACA GCGATAGCCG GAGCGCAGTT CAATCTGAGT TACGACCCGG CAGTGCTGCA GGTTAACTCA GTAACCGAAG GCGGATTGCT TAAACAAAAC GGCAACACTT CCTTCTTCCT GACCGGCGTT ATTGATAATA ACAGCGGGCT GCTCAATAAT GTTGCCGGAG CCATTACTAC TCCCGGCGGA GAAGTGAGCG GAGAGGGATC TTTGGCTGTC ATCTCATTTA CGGCCAAGGC CACAGGTACG TCCACCCTTG CTTTGAGCAA TGTCATAGCC GCCAATAAAG CGGCCCAAGC CGTACCGGTA CAGGTTAACG GCGGCAGCGT CACCGTACAG GGAGGAACAA GCAGCGAACC GGTCAGCGGA GTCAGCCTGG ATCAAACCAG CTACAGCCTG ACCGTGGGAG AAACCGGCCA GCTAACCGCT ACTGTAGCAC CGGCAAATGC CAGCAACAAA AATGTTACCT GGACATCAGA CAATGAAGCA GTGGCCACAG TAGACGCCAC CGGCAAAGTA ACGGCAGTAT CCGCCGGCAC GGCCAATATC ACCGTGACCA CGGCAGACGG GGGCCTCACC GCCACCTGCG CAGTCACCGT ACAGGGAGGA ACAAGCAGCG AATCGGTCAG CGGAGTCAGC TTGGATCAAA CCAGCTGCAG CCTGACTGTG GGAGAAACCG GCCAGCTAAC CGCCACTGTA GCACCGGCAA ATGCCAGCAA CAAAAATGTC ACCTGGACAT CAGACAATGA AGCAGTGGCC ACAGTAGACG CCACCGGCAA AGTAACGGCA GTATCCGCCG GCACGGCCAA TATCACCGTG ACCACGGCAG ACGGGGGCTT CACCGCCACC TGCGCAGTCA CAATTGCAAT CGGCAGCGTC ACTATAGTGG GCATCGACCC GGCGACGAAG ACTGTCAGTG CCGGTGACAC CTTTGACCTG GATGTTCTGA TTACCCCGGC CACAGCGATA GCCGGAGCGC AGTTCAATCT GAGTTACGAC CCGGCAGTGC TGCAGGTTAA CTCAGTAACC GAAGGCGGAT TGCTTAAACA AAACGGCAAC ACTTCCTTCT TCCTGACCGG CGTTATTGAT AATAACAGCG GGCTGCTCAA TAATGTTGCC GGAGCCATTA CTACTCCCGG CGGAGAAGTG AGCGGAGAGG GATCTTTGGC TGTCATCTCA TTTACGGCCA AGGCCACAGG TACGTCCACC CTTGCTTTGA GCAATGTCAT AGCCGCCAAT AAAGCGGCCC AAGCCGTACC GGTACAGGTT AACGGCGGCA GCGTCACCGT ACAGGGAGGA ACAAGCAGCG AACCGGTCAG CGGAGTCAGC CTGGATCAAA CCAGCTACAG CCTGGCCGTG GGAGAAACCG GCCAGCTAAC CGCTACTGTA GCACCGGCAA ATGCCAGCAA CAAAAATGTT ACCTGGACAT CAGACAATGA AGCAGTGGCC ACAGTAGACG CCACCGGCAA AGTAACGGCA GTATCCGCCG GCACGGCCAA TATCACCGTG ACCACGGCAG ACGGGGGCCT CACCGCCACC TGCGCAGTCA CCGTACAGGG AGGAACAAGC AGCGAATCGG TTAGCGGAGT CAGCTTGGAT CAAGCCAGCT GCAGCCTGAC TGTGGGAGAA ACCGGCCAGC TAACCGCCAC TGTAGCACCG GCAAATGCCA GCAACAAAAA TGTTACCTGG ACATCAGACA ATGAAGCAGT GGCCACAGTA GACGCCACCG GCAAAGTAAC GGCAGTATCC GCCGGCACGG CCAATATCAC CGTGACCACG GCAGACGGGG GCTTTACCGC CACCTGCGCA GTCACTGTGC AGGAAGGAAC AAGCAGCAAG CCGGTTACCG GAGTCAGTCT GGATAAAACC AGCGACAGCC TGACTGTGGG AGGAACCGCT CAGCTAACTG CCGCTGTCCA GCCGGCTGAC GCCAGCAATA AAGACGTCAC CTGGAGTTCA GACAATGAAG CAGTAGCCAC AGTAGACGCC ACCGGCAAAG TGACGGCAGT ATCCGTTGGT ACGGCCAATA TTACCGTGAC CACGGCAGAC GGGGGCTTTA CCGACACCTG TGTAATTACT GTGCCGCAAG TTGTTAATCT TGATCCGGAT TCCAACAGTA TTGCTGTTAC TAACAATCCT ATGGAAGTCA ATGTTCCTAA AGATGTGGCT AATCCTCAGA TAACATTAAC ACCGTCTTCT GGCGGCAGCT TTACCATGCC GCAGGTTAAA GTTAAGGCCG ATACCTCTCT GGGAACAGTA TCAGTGGGAA TTTCCGCGGG AACGCAAATC ACTGGTCCGG CAGACTGGAA CGGAATCATA GCGCTTCCCC AAGTGTTGCT TAATAATGCT GTTACAGTTA CTCCAACCAG CGGTAAAACT GCTACAGTTG AAGCAGTGGT TGAAATTGGT GCAAGTAATG ATGTTGCGCT GACCTTTGAT CATGCTGTAA GAATGCTTAT ACCAGGCATG GCAGGCAAAC AGGCCGGTTA TGTCCGGGGG GGCGTCTTCC AGAAAATAGA GACCCTTTGT GAGCAGGATA GTCAGGAATG GGCTGATAGT AACCTTCCAG CAGAAGGCGC AGGAAAAATA GATGTTAATG GAGACTTAGT AATTTGGACC AAGCACTTCA CTAAATTCGT CACCTATACT GAAACCGCTG TTGGCGGTGG AGGCGGTGGC GGAGGCGGTG GAAGCAGTAC AACTACACCC AGTGTTCAGA CAAGTGAGGC AGCAGGCATT ACAGAAAGCT CCGCTACCCT GAACGGCAGC ATTACCTCCA GTGGCGGATC GGCCGTCACC TCATACGGTT TTGTTTGGGG TACCGATCAG AACAATCTGG ACAAAAAAGT GCAGGTAGGA ACAGACAATC ACAGCGGGTC TTTTCAGACA AATCTCAGCG GATTGACAGC CGGGATCACT TATTACTTTA AGGCCTATGC GGTTAACTCC AAAGGTACGA AGCAAGGTGC AATAAAGCAG TTTACCGCTG CTGCGGCTGC TCAACAGCCT TCCGTTGAAG TTACGTCGCC GAATGGCACA CCAAGCAATG TAAAGAGCTT CTCTGATGTT GCCAATAGCC ACTGGGCATA CAATGCTATT TCCCAGCTGA GTAAGAACGG CTTGGTCAAT GGCTATCCTG ATGGTACCTT CCGGCCCGAC AGGGAGATGA GCAGAGCGGA GTTTGTTACC GTTTTGAGCA AGGCGCTGCA ACTGCCGGAT TATAAGCCGG CGGCATCTAC CTTTGGTGAT GTTGCGGCTT CCGATTGGTA TGCCGGGGCG GTGGAAACTG TTGTTCATGC CGGTATTATC AGCGGTTATG GAAATGGCAG TTTTGGACCG AACGATTCGG TTACCAGAGA ACAGGCATCA GTCATTCTGG TTAAAGCTTT GGGCAAAAAC AATGAGGCTG CAGCCAAGAT GCTTGATTCG ACCGGATTTA CCGATGATAC GGCTATTTCC TCTTGGGCAC GGGGCTTCGT GGTGGCAGCG GAAGAGCAAA ATCTACTCAA AGGCTATCCG GACAGCACCT TCCGCCCGCA GCAAAATATG ACTCGGGCAG AGATCTGCAC CGTAATTAAA AATTTGCTTG ATACGAAAAG CCAGAGTGAA AATTAG
|
Protein sequence | MNNLFIPMYK ILVRVLLIIV VASGFLFNSL DLQGTKVEAS TSGATTLSIE PVTKNVSAGD TFTLNVLVAP ATAIAGAQFN LSYDPAVLQI NSVTEGGLLK QNGNTSFFLP GDIDNTEGLL KNVAGAITTS GGEVSEEGVL ATVTFTAKAS GTSTLTLSNV IAGSKVGQSV PVQVIGGSVT VQGGTSSESV SGVSLDQTSC SLTVGETGQL TATVQPANAS NKNVTWTSDN EAVATVDATG KVTAVSAGTA NITVTTADGG FTATCAVTVQ GGTSSESVSG VSLDKTSCSL TVGETGQLTA TVQPANASNK DVIWSSDNEA VATVDATGKV TAVSAGTANI TVTTADGGFT ATCAVTITTG SATIVGIDPA TKTVSAGDTF DLDVLITPAT AIAGAQFNLS YDPAVLQVNS VTEGGLLKQN GNTSFFLTGV IDNNSGLLNN VAGAITTSGG EVSGAGSLAV ISFTAKATGT STLTLSNVIA ANKAAQAVPV QVNGGSVTVQ GGTSSESVSG VSLDKTSYSL TVGETGQLTA TVAPANASNK NVTWTSDNEA VATVDATGKV TAVSAGTADI TVTTADGGFT ATCAVTIATG SVTIVGIDPA TKTVSAGDTF NLDVLITPAT AIAGAQFNLS YDPAVLQVNS VTEGGLLKQN GNTSFFLTGV IDNNSGLLNN VAGAITTPGG EVSGEGSLAV ISFTAKATGT STLALSNVIA ANKAAQAVPV QVNGGSVTVQ GGTSSEPVSG VSLDQTSYSL TVGETGQLTA TVAPANASNK NVTWTSDNEA VATVDATGKV TAVSAGTANI TVTTADGGLT ATCAVTVQGG TSSESVSGVS LDQTSCSLTV GETGQLTATV APANASNKNV TWTSDNEAVA TVDATGKVTA VSAGTANITV TTADGGFTAT CAVTIAIGSV TIVGIDPATK TVSAGDTFDL DVLITPATAI AGAQFNLSYD PAVLQVNSVT EGGLLKQNGN TSFFLTGVID NNSGLLNNVA GAITTPGGEV SGEGSLAVIS FTAKATGTST LALSNVIAAN KAAQAVPVQV NGGSVTVQGG TSSEPVSGVS LDQTSYSLAV GETGQLTATV APANASNKNV TWTSDNEAVA TVDATGKVTA VSAGTANITV TTADGGLTAT CAVTVQGGTS SESVSGVSLD QASCSLTVGE TGQLTATVAP ANASNKNVTW TSDNEAVATV DATGKVTAVS AGTANITVTT ADGGFTATCA VTVQEGTSSK PVTGVSLDKT SDSLTVGGTA QLTAAVQPAD ASNKDVTWSS DNEAVATVDA TGKVTAVSVG TANITVTTAD GGFTDTCVIT VPQVVNLDPD SNSIAVTNNP MEVNVPKDVA NPQITLTPSS GGSFTMPQVK VKADTSLGTV SVGISAGTQI TGPADWNGII ALPQVLLNNA VTVTPTSGKT ATVEAVVEIG ASNDVALTFD HAVRMLIPGM AGKQAGYVRG GVFQKIETLC EQDSQEWADS NLPAEGAGKI DVNGDLVIWT KHFTKFVTYT ETAVGGGGGG GGGGSSTTTP SVQTSEAAGI TESSATLNGS ITSSGGSAVT SYGFVWGTDQ NNLDKKVQVG TDNHSGSFQT NLSGLTAGIT YYFKAYAVNS KGTKQGAIKQ FTAAAAAQQP SVEVTSPNGT PSNVKSFSDV ANSHWAYNAI SQLSKNGLVN GYPDGTFRPD REMSRAEFVT VLSKALQLPD YKPAASTFGD VAASDWYAGA VETVVHAGII SGYGNGSFGP NDSVTREQAS VILVKALGKN NEAAAKMLDS TGFTDDTAIS SWARGFVVAA EEQNLLKGYP DSTFRPQQNM TRAEICTVIK NLLDTKSQSE N
|
| |