Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_1975 |
Symbol | |
ID | 8428957 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | + |
Start bp | 2115722 |
End bp | 2118625 |
Gene Length | 2904 bp |
Protein Length | 967 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 645034303 |
Product | hypothetical protein |
Protein accession | YP_003191434 |
Protein GI | 258515212 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000301767 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0000479173 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTGAAAT ATCACCAGAG GGGACTATTT TTTGCCCTCT TTTGTGCTTT GCTGTTTGTT TTTGCGACCG GCGGCACTGC GTTAGCCACG GAAGTAGAGG AAGTAAATGG TTTGTATGTA ATTGCCACAT TGCCGGGAAG CGACACACCC CTTCCGGTTA AGTACTATGG CGATATAGCG GGTAGTGTTG CTGCTTCTCC TTGCTATATC ACGGTGCCGG CGGGGACGAC CCAGATTACC ATAAAGAAAG ATCCAAATTT AAATACAGAT TACACTTTTA CCTCTTTAAG GGTGTATAAG CCGCCCTATA AGGCTGCAGA TAAATATCCC CTTGATGCCG GGGCGGAATT TTCCGTAACG GTGCCCGCCA CGGCCACCCG GCAGAATCCT TATCGAATAA ACTCTGCTAT TAAAAACTCT AACGGTGCTG ATACTAGTAA TACTATATTT TTCTACTGTG TGGAGGAAAA CGGTCCCAAT CCCTTTGTAA CACCGGATCG CCAGGAAGTG GCGGAGGGAA GCACCGTATC TCTGAATGTG TACGGCACCG GGAATCCGGC CCAACCGGTG GAATGGTATT ACGGCTATTA TGATAGTAAC TCGTCGACAA GAATTTACGA GAAACTGGAA GGGGAAACCG GCGCCGACCT GAATATCACC GCTGTCCAGG ATTCTTTGAA CCTGGGATCT TTTAACGAGG AAACCAGCCA GTATTCTCTT CGCTTTTACA CCAAAATCGG CACATATATC TCCAATGAAG TTTATGTGAC AGTGAACGGA ACCCCGGTCT CCAAACCGGA TATAAAATCC CTGGATGCCA GTCACTGGGG TAGCTATGCC GGGGCGAAGC TCTGGCTGAA TGTGGGAGCC GCCTTTAACG ATACGGTGGC CGCGGAGACT CCGGTGAAAT TCTATCTGAC CGCCGCGGAA AACGGCACGC CGGATACGGC TATTGCCGGC ACCCTGGTGG AAAGCACTGT GGCCGCGGTA CCCGGCGCCG CTGCCGAGGA TTATAAGGAT CTGGGTTTTC TGGTGCAGGA TCTGCCCGCC GGCAGCTACT GGCTCGCCAT CCAGCTGGGC ACGGACAATC CTTCCTACAG CTACAGAGCC TTTACAGTTT ATCCGGGCGT GAAGGAATAT GTCAAAGCTT CTCTGGACAA AATGATCTCC TGGTATCAAA CGAAGATATT TGTAAAAGAT GGAATTTATG TGGGCCTTTC CGAGCGTAGA GACGGTACGG ACTGGGATGC CTGGATCTTT CCCAATTACG GCTATGCCGT CACCGATCCC CTGCTGGCTG GCGCCGACGG CAAGACCTAT TTGGACGGCC TGGAAGCGGC TTTAAAAAGC CGGGACGCCG GCGGTACACT CAATTCGCCC AAAGACGATT TCCGTTATGT GGCTGCCCTT TGTGCCGTGG GGGCGGACCC CCGGAATTTC AACGGCCGGA ATCTGGTGGC GGAGCTGATC GGCCACGCCT ATAACGCGGA CGGCACCCTG AAGTTGGGGA AAGACGGCGT GCTGGATCTA AAAATTGATA TCCTGACCGT CTCCTATCTC CTCCTGGGGG CGGAAATTGC CGGGGCCACG GAGGCGGAAG GCTATACCGA CGCCCTGAAA AAAGCCGGGA TCAAGGCCAT TCTGCCCACC GTGGAAATGT CTGTCGATGA GCGCAGTGAT ATCAGTATGA TCTCTTCCAG CGACTGGCTG GCCATGCAGA GCTATCCCCT ATATTTCCTG CAGGATGATC TGGAATACGG GGATCGCATT AGAGACGCCG TCGGCAAAAT GGCGGCAATG GTATCCACTC ATTTATACGC TAACGGCGGC GTAACCATGA ACTGGCCCGG GGTGAGCAGC CCGGGATCCT TTGATACCCC TCTGCCTGCC ACGGCTTACG CTGTCAATCC TAACAGCATG GCCGTGATGC TCAATGCCCT GGTCCTCTTC GGCGCCACGG CGGAAGACCT GGGCAGCAAT GCCTGGCAGG AAGATTGGGG CACTTATATG ACCGCCCTTC TGAGCTTGCA GATGGATGAC GGCTCCGTGG GCTTCAACGG TGGCAGCAAT GACATGGCCA CCTACCAGAC ACTGGGGGCC CTGGTGGAAC TGCATACGGG CAGGAGCTGC TTTGTTAACG CCCGGGACAC CTATCTGAGC AAGTATCCCG ATTATGCCGC CCAGATCACC GCTCCCTTTA TTTCCGACGG AAGTGGGGCG CGCTTGTCCG CCGGGGAAGG GAAGGTCACC TTTACCTCCA ATGAAGGAGG ACAAGCCTAT TACGCCGTGG TGGACCGCGG GGCCGCCGCT CCCGTCATCG ACACCTCCGG AGCGGGAACA GATTGTGTGT CCGGGGTCAA TACCATAGTG ATCGGGGATT TGGTGGATAG CAGCGCCAAA GATGTCTACC TGCTGGTAAA GGACGGAGAC GGCCACAGCA GCAAAACCTT GCAGATCAGC ATGGCGGATT CCTCCGCGGA CACCGGAGCA CCGGTGATCA CCTTAAACGG TGCCTCGGCA ATCACCTTAA ATCTCGGGGA AAGCTTCACC GATCCGGGAG CGACGGCCAC AGACAATGTG GACGAAGACC TGACGGACCG GATCACCGTA GGCGGAGACA CGGTGGATGT CAATACGCCG GGAACCTACA CCATCACCTA TAATGTCAGC GACGCCGCCG GCAACTCGGC AGCACAGGTA ACCAGAACCG TCACCGTGCA GGGAACAGAA ACCGGAGACG GGGATGTAAA CCGGGACGGC AAGGTGAATG TGATGGATAT GATTCTGGTG GGCCAGCATT TCAATGAAAG CGGAGCGGCC GGCTGGATCA GCGCGGATAC GAATAAAGAC GGGAAGATCG ATGTATCGGA TTTGATCTTC ATCGGACAGC ACTGGAAAGA TTAA
|
Protein sequence | MLKYHQRGLF FALFCALLFV FATGGTALAT EVEEVNGLYV IATLPGSDTP LPVKYYGDIA GSVAASPCYI TVPAGTTQIT IKKDPNLNTD YTFTSLRVYK PPYKAADKYP LDAGAEFSVT VPATATRQNP YRINSAIKNS NGADTSNTIF FYCVEENGPN PFVTPDRQEV AEGSTVSLNV YGTGNPAQPV EWYYGYYDSN SSTRIYEKLE GETGADLNIT AVQDSLNLGS FNEETSQYSL RFYTKIGTYI SNEVYVTVNG TPVSKPDIKS LDASHWGSYA GAKLWLNVGA AFNDTVAAET PVKFYLTAAE NGTPDTAIAG TLVESTVAAV PGAAAEDYKD LGFLVQDLPA GSYWLAIQLG TDNPSYSYRA FTVYPGVKEY VKASLDKMIS WYQTKIFVKD GIYVGLSERR DGTDWDAWIF PNYGYAVTDP LLAGADGKTY LDGLEAALKS RDAGGTLNSP KDDFRYVAAL CAVGADPRNF NGRNLVAELI GHAYNADGTL KLGKDGVLDL KIDILTVSYL LLGAEIAGAT EAEGYTDALK KAGIKAILPT VEMSVDERSD ISMISSSDWL AMQSYPLYFL QDDLEYGDRI RDAVGKMAAM VSTHLYANGG VTMNWPGVSS PGSFDTPLPA TAYAVNPNSM AVMLNALVLF GATAEDLGSN AWQEDWGTYM TALLSLQMDD GSVGFNGGSN DMATYQTLGA LVELHTGRSC FVNARDTYLS KYPDYAAQIT APFISDGSGA RLSAGEGKVT FTSNEGGQAY YAVVDRGAAA PVIDTSGAGT DCVSGVNTIV IGDLVDSSAK DVYLLVKDGD GHSSKTLQIS MADSSADTGA PVITLNGASA ITLNLGESFT DPGATATDNV DEDLTDRITV GGDTVDVNTP GTYTITYNVS DAAGNSAAQV TRTVTVQGTE TGDGDVNRDG KVNVMDMILV GQHFNESGAA GWISADTNKD GKIDVSDLIF IGQHWKD
|
| |