Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_3601 |
Symbol | |
ID | 8430607 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | - |
Start bp | 3798502 |
End bp | 3800514 |
Gene Length | 2013 bp |
Protein Length | 670 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 645035829 |
Product | glycoside hydrolase 15-related |
Protein accession | YP_003192936 |
Protein GI | 258516714 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAGAC CGCTGGTGCT TGGCAACGGA AAAATGCTAA TCGCCTATGA CCAAGAATTC AGCTTGAGAG ATTTCTTTTA CCCGCATGTC GGTCAGCATA ATCATATTAT GGGCAGATCC AACCGCTTGG GCTTCTGGGT TGAGGGCATG TTCTCCTGGC TGGATGCTCC ATCATGGCAC AAGTTGGCCA ATTATCGAAA AAACACTCTG GTTATGGAAT CAACTGCTAA AAACGAGGAA ATGGGCTTAT CCATTGTAAA TACAAATGCG GTACACTACC GGGAAAATAT TTACTTAAAG CGCCTGATAG TGCAAAACCT GACTGACAGG GGACGGGAAG TGCGCGTCTT TGTCACGCAC GACTTTTCTA TTAATGAAAA TGAGGTGGGA GATACTGCAT TATATGATGC CCGGTTGAAA ACTGTTATTC ATTATAAGAA AAACTGCTAT ATTTTGATTA ACGGTTTTTT CGGACCATCT TCCTTTAATA TGTACACCAT AGGTGTGAAA CGGGCTTTTG GTGCGGAAGG CACATGGCGT GACGCAGAAG ACGGGGTGTT AAGCAGCAAT CCTATTGCTA TGGGATCTGT AGACAGTACT ATTGGTTTTA AAATCTATCT GGGGGCCAAA GAAGAAGCCA GCATCTACTA CTGGATTGCT GTCGGGCCCA GTTATAATGA GGTAAAAAGG TTAAATGAAT TTGTTTTGCA GCAAACACCC AAGGCTCTGA TAAAACGTGT GGAAAGCTAC TGGAACCACT GGCTGAGCAA TGTCCCGCAG GACTTCCTGG ATCTGCCGGA ATATTTGAGT GATTTATACA ACCGCAGTCT CCTGGTTATA CGCACTCATC TTGATAAAGA CGGTGCCATT TTAGCCGCTG CTGACAGTGA TATCGTCCTG ACCAACAAAG ACCATTACTG CTATCTCTGG CCCAGGGACG GGGCGCTGGT TGCCTGTGCC TTAATTAAAG CGGGCTACCC TCATATAACT AGGAAGTTTT TCCAATTTTG TGCCAATGTA ATAACCGAAG AAGGGTACCT GCATCATAAA TATAATCCGG ATGGAACAGT AGGCTCCAGT TGGCACCCCT GGAATAATCC GGAGAGATTG CCTATACAAG AAGATGAGAC AGCTCTGGTT ATTTATGCTC TTTGGCAGTA CTATGAATTT ACAGGTGATC TTGAATTCAT CAGCGATATA TACAGGAAAC TGATTATACC GGCAGCCGAT TTTATGAACA ATTATATGGA TAACCAGTTG GAATTACCTA AGCCCAGCTA CGATTTATGG GAGGAAAGGC ACGGTATATT CACTTTTACT GCCGCGGCTG TTTATGCCGG ATTAGTTGCT GCGTCAAAAT TTTCCAATAT ATTTTATGCT GCCGAACCGG CAGAAAAATA TCTTGCCTGT GCCAATAAAA TTAAAAAATC TATTAAAACT CATTTATTTG ATCCCGTACT AAAAAGATTT ATCAGGGGTC TGATCTGGGA TCAAGAGGGA GGTTACTACC GCAGGGATAC TACCATGGAA AGCAGTGTAA TGGGACTTTC ATTTTTAGGT GTTTTGCCTC CGGATGACCC TTATATGCAA ACTACCATGG TGGCTCTGGA AGAAGGCCTG TGCAATAAAA CCTGGGTGGG CGGCATGGCC AGGTATACAT ATGACAGGTA TCACAGAAAA ACCACTGATG AAGGAATACC GGGCAACCCC TGGTATATTT GCACTCTCTG GCTGGCTCAG TGGCACATAG CCAAAGCAAA CACCTTAAAG GATATGAAAC CTGCGCTGCC GATTCTTCAC TGGGCCGCAG CTTTTGCTAT GGAAACCGGG ATTATGCCGG AACAGCTTAA TCCTGAGACA GGCGAGCCTC TTTCCGTAGC ACCACTGGTC TGGTCACACT CTACTTTTGT ACAAACCGTG CTGGAGTACA TCAGCAAATA TAAGTCAATG GAAATAGAAG ACATATGTAA AATCAATACT AACAACACAA TTCCAGAACC GCATGCAACT TAA
|
Protein sequence | MTRPLVLGNG KMLIAYDQEF SLRDFFYPHV GQHNHIMGRS NRLGFWVEGM FSWLDAPSWH KLANYRKNTL VMESTAKNEE MGLSIVNTNA VHYRENIYLK RLIVQNLTDR GREVRVFVTH DFSINENEVG DTALYDARLK TVIHYKKNCY ILINGFFGPS SFNMYTIGVK RAFGAEGTWR DAEDGVLSSN PIAMGSVDST IGFKIYLGAK EEASIYYWIA VGPSYNEVKR LNEFVLQQTP KALIKRVESY WNHWLSNVPQ DFLDLPEYLS DLYNRSLLVI RTHLDKDGAI LAAADSDIVL TNKDHYCYLW PRDGALVACA LIKAGYPHIT RKFFQFCANV ITEEGYLHHK YNPDGTVGSS WHPWNNPERL PIQEDETALV IYALWQYYEF TGDLEFISDI YRKLIIPAAD FMNNYMDNQL ELPKPSYDLW EERHGIFTFT AAAVYAGLVA ASKFSNIFYA AEPAEKYLAC ANKIKKSIKT HLFDPVLKRF IRGLIWDQEG GYYRRDTTME SSVMGLSFLG VLPPDDPYMQ TTMVALEEGL CNKTWVGGMA RYTYDRYHRK TTDEGIPGNP WYICTLWLAQ WHIAKANTLK DMKPALPILH WAAAFAMETG IMPEQLNPET GEPLSVAPLV WSHSTFVQTV LEYISKYKSM EIEDICKINT NNTIPEPHAT
|
| |