Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1500 |
Symbol | |
ID | 4810538 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1822430 |
End bp | 1824322 |
Gene Length | 1893 bp |
Protein Length | 630 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640106920 |
Product | ABC transporter related protein |
Protein accession | YP_001037921 |
Protein GI | 125974011 |
COG category | [V] Defense mechanisms |
COG ID | [COG1132] ABC-type multidrug transport system, ATPase and permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000079767 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCCGG GCAGAGGGAA AATGTCAGGA GGCTTTGGCG GACCCGGCGG GAAAAGACCG CTTGGATTTG GTGGTCCAAG AGGTATGCGC GGTGCTGTGG GCGGTGCCAA ACCGAAAAAT GCATCGGCTA CAATTAACAG GCTGCTGGCT TATATCGGAC GGGACAAGAT TAAAATTCTG TTTGTTTTTG CGTGTGTACT GGGCAGCAGC CTGGCAAGCC TTGCCGGAAG CTATATTTTA CGTCCTGTCA TAAACAATCT GGTTTATTCC GACGGAACTG CAAAAGAAAA AATAAACAAT CTTGTAATCG GTATACTTAC CATGGCATGC ATTTATTTGG CCGGAGTGGT ATGTTCTTAC CTGCAGCAGA GAATCATGAT AGGAGTTTCG CAGAATGCGC TGATTAGAAT CAGGGAAGAG TTGTTTCGCA AGATTCAAAA GCTGCCTTTA AAATATCATG ACACTCACAC TCACGGTGAT ATAATGAGCC GTTTTACCAA TGACCTTGAT GCGGTTGGAG AAATGCTCAA TAACACCATG TCCCAGATTT TTTCAGGTAT TATTACCCTG GTTGGCACTG TTGCTCTTAT GTTTTTTACA AACTGGATAC TTGCAATCAT TATCATCGTG ACGGTGCCTT TGATGGTCTA CGTGGGAGGC ATGATCGGAA AGCAGAGCAG AAAATATTTC ATAGGTCAGC AGCAGGCCCT CGGTGCGGTT AACGGATATA TTGAGGAAAC TGTTACGGGT CAAAAGGTGA TAAAAGTGTT TTGCCATGAA GAAACAGTAG TGGAAGAATT CGAGTTTTTA AGTGACAATT TGCGTGAAAA ACAGAGAAAG GCACAGTTTT TTGGAAGTAT TATGGCACCT GTTATGGGTA ATTTAAGCCA AATAAGCTAT GCGTTGTCTG CCACAATCGG AGGTGTTCTC TGCCTGACGC TGAATTTTGA TATTGGCGGC CTTGCCATAT TTACTAACTA TTCAAGACAT TTTGCAAGGC CGATTAGTGA ATTGTCCATG CAGATGAATA CGATTTACGC GGCATTGGCC GGAGCGGAGA GAGTGTTTGA AGTGATGGAT GAAACACCTG AGAAGGGTGA TTCCCCGGAC GCTATAGAAA TCTGTTCCGA AGCAAACCGG GTGGAAGGAG CACAGCCAAT TAAAGGAGAA GTGGTGTTGA AAAACGTAAC CTTTGGTTAT GTACCGGGAA AAACCGTGCT GAAAAATATT AATGTAACAG CCAAACCCGG CCAGAAAATC GCTTTTGTGG GTTCCACGGG TGCCGGGAAA ACGACCGTTA CGAACCTTAT AAACAGGTTT TATGAGATTG AAGAAGGAGA AATACTGATT GACGGTATCA ATATCAAGAA TATTAAAAAG GACTCATTAA GAAGCAATAT AGCAATAGTG CTTCAGGATA CCCATTTGTT TTCCGGAACG GTCAGAGAGA ATATTCGGTA TGGCCGTCTT GACGCAACCG ATGAAGAAGT CGTTGCAGCG GCCAAGGTGG CCTGCGCCCA TTCCTTTATC GAAAGGCTGC CCCAGGGATA TGATACGGTA TTGGATGGGG ACGGAGCGAA TTTGAGCCAG GGTGAACGGC AACTTCTCAA TATTGCGCGG GCTGCCATAT CAAAAGCTCC CATTCTGATA CTGGACGAAG CCACCAGTTC GGTTGACACG AGGACCGAAA AATACATTGA GCGTGGTATG GACAGGTTGA TGAAAAACCG GACTACTTTT GTAATCGCGC ACAGACTGTC CACCGTGCGA AACGCAGATC TGATTATAGT GCTGGAACAC GGTGAAATTA TAGAACAGGG AACCCATGAA GAACTCCTTG GGATGGGCGG AAGATATTAT CAGCTTTATA CAGGAGCGGT TGAACTGGAC TAG
|
Protein sequence | MKPGRGKMSG GFGGPGGKRP LGFGGPRGMR GAVGGAKPKN ASATINRLLA YIGRDKIKIL FVFACVLGSS LASLAGSYIL RPVINNLVYS DGTAKEKINN LVIGILTMAC IYLAGVVCSY LQQRIMIGVS QNALIRIREE LFRKIQKLPL KYHDTHTHGD IMSRFTNDLD AVGEMLNNTM SQIFSGIITL VGTVALMFFT NWILAIIIIV TVPLMVYVGG MIGKQSRKYF IGQQQALGAV NGYIEETVTG QKVIKVFCHE ETVVEEFEFL SDNLREKQRK AQFFGSIMAP VMGNLSQISY ALSATIGGVL CLTLNFDIGG LAIFTNYSRH FARPISELSM QMNTIYAALA GAERVFEVMD ETPEKGDSPD AIEICSEANR VEGAQPIKGE VVLKNVTFGY VPGKTVLKNI NVTAKPGQKI AFVGSTGAGK TTVTNLINRF YEIEEGEILI DGINIKNIKK DSLRSNIAIV LQDTHLFSGT VRENIRYGRL DATDEEVVAA AKVACAHSFI ERLPQGYDTV LDGDGANLSQ GERQLLNIAR AAISKAPILI LDEATSSVDT RTEKYIERGM DRLMKNRTTF VIAHRLSTVR NADLIIVLEH GEIIEQGTHE ELLGMGGRYY QLYTGAVELD
|
| |