Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1799 |
Symbol | |
ID | 4809783 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2124309 |
End bp | 2126225 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640107213 |
Product | ABC transporter related protein |
Protein accession | YP_001038213 |
Protein GI | 125974303 |
COG category | [R] General function prediction only |
COG ID | [COG0488] ATPase components of ABC transporters with duplicated ATPase domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAGTTT TAGGTTGTAA CAATATAAGT CTGTCCTTTG GAGTCACAAC AATACTGAAA GATATTTCTT TCAGTATCAA TGATACTGAC AAAGTCGGCG TTGTGGGAGT AAACGGAGCG GGCAAATCCA CACTTTTTAA AATAATAGCC GGCGAATACA TTCCGGACAG CGGAGAAATT TATACAGCCA AAAACTCAAA ACTCGGTTAC CTTGCACAAA ATTCCGGTCT GGACTCGTCC AACACAATAC TGGAAGAAGT CCTGGCGGTG TTTTCCCACT TCACTGAAAT GGAATCCCGT ATAAAAGATT TGGAAAAATC CATGAGCACA GAAAAAGATG AAAATCAATT AAATTCAATC ATGAAGGAGT ATTCACGATT GACCGATGAA TATGCCCGTT TGGGAGGGTT TGAATATCAA AGCCGCGCAA AGGGTGTTTT GAAAGGTTTG GGATTTGAGG AAGACCAATT CTCATTGAAT GTCATGAATT TGAGCGGGGG GCAGAAAACA AGGCTTGCCC TGGCAAGGCT TCTTCTTACC GAACCGGATA TTCTTCTTCT GGACGAGCCT ACAAACCACC TTGACATAAA GGCCGTGGAA TGGCTGGAAG AGTTTTTGTC AAATTACAAA AAGTGTTTGA TGGTTATATC CCATGACAGA TATTTTCTTG ACAAAATCAC CAACAAAACA CTGGAAATTG AAAACTGTGA ATGCAAGCTC TACAACGGCA ATTATTCAAG ATATTTAAAC CAAAAAGCTG TGGACCGGGA AATTCAGCAA AGGCATTATG AACAGCAGCA GAAGGAAATT GCACGAATGG AAGCATTTAT AGAGCAGCAG CGCAGATGGA ACAGGGAAAG AAATATCATT GCTGCGGAAA GCAGACTGAA AGCCATAGAA CGAATGGAAA AAATTGAAGC TCCAAAAAAC TTGCCGGAAA AGATAAGAAT AAAATTCAAA AGCGGTTTTG CCAGCGGAAA TGACGTTCTC TTTGTGGAAG GTCTGGGAAA ATCATATCCC GGCAAGCCAC TTTTTAAAAA CGTAAAATTC AACATCAGAA AAAAAGAAAG AGTATTCATT CTAGGTCCCA ACGGATGCGG AAAGTCCACC CTTCTGAAAA TACTTACCGG CAAAATTGAC GACTATGAAG GAAGCTTCCG GTACGGGCAC AATGTAAATC CAGGTTATTA CGACCAGGAG CAGGAAGGGT TAAACCCGAA CAATACAGTA ATTGATGAGG TTTGGAGTGC CGACGAAAAG CTTACCCAGA CAGAAATCAG AAATGTTCTG GCAATGTTTT TGTTTAAAGG GGAAGATGTC TTAAAGCCGG TTTCGACTTT AAGCGGCGGT GAAAAAAGCC GGATTTCCCT TATTAAATTG ATGCTTTCCG AAGCCAACTT CCTTATAATG GACGAGCCCA CAAACCATCT TGATATAAAC TCGAGGGAAG TCCTTGAAAG TGCCCTTGCG GATTATGACG GTACTTTGCT CATTGTATCC CATGACAGAT ACTTTATAGA CAAACTTGCA ACACGTATTA TTGAGCTTGG CGAAACTTCA TGCATTGACT TTAAGGGAAA CTACACCGAG TTTCATGAGT ACAAGAGCAA GTTAAATTCG GGGTCGGACA ACCAATCCAA AAATGTCAAA ATGACGGCAT CAAAAATGGA ACATATCGCA ACAAAGGAAG AAAAGGCAAG AAAAAGAAAA CTTGAAAAAC AGCTTGTCGA GACGGAAAAA GAAATCACCG ACACCGAAGC CCGCATAAAA GAGATTGAAA ATCAAATGAC CAACGAGGAA GTTGTCAGTG ATCATGTAAA GCTTGTGGAA CTTCACAACG AATTGAACGA GCTTAATTTA AAACTTGAAC AGCTGTATGA ACTTTGGGAC AACCTTATGT CTGAAAACAG CAGGTAG
|
Protein sequence | MIVLGCNNIS LSFGVTTILK DISFSINDTD KVGVVGVNGA GKSTLFKIIA GEYIPDSGEI YTAKNSKLGY LAQNSGLDSS NTILEEVLAV FSHFTEMESR IKDLEKSMST EKDENQLNSI MKEYSRLTDE YARLGGFEYQ SRAKGVLKGL GFEEDQFSLN VMNLSGGQKT RLALARLLLT EPDILLLDEP TNHLDIKAVE WLEEFLSNYK KCLMVISHDR YFLDKITNKT LEIENCECKL YNGNYSRYLN QKAVDREIQQ RHYEQQQKEI ARMEAFIEQQ RRWNRERNII AAESRLKAIE RMEKIEAPKN LPEKIRIKFK SGFASGNDVL FVEGLGKSYP GKPLFKNVKF NIRKKERVFI LGPNGCGKST LLKILTGKID DYEGSFRYGH NVNPGYYDQE QEGLNPNNTV IDEVWSADEK LTQTEIRNVL AMFLFKGEDV LKPVSTLSGG EKSRISLIKL MLSEANFLIM DEPTNHLDIN SREVLESALA DYDGTLLIVS HDRYFIDKLA TRIIELGETS CIDFKGNYTE FHEYKSKLNS GSDNQSKNVK MTASKMEHIA TKEEKARKRK LEKQLVETEK EITDTEARIK EIENQMTNEE VVSDHVKLVE LHNELNELNL KLEQLYELWD NLMSENSR
|
| |