Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2522 |
Symbol | |
ID | 4809278 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2990668 |
End bp | 2992290 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640107938 |
Product | membrane associated protein |
Protein accession | YP_001038917 |
Protein GI | 125975007 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0879988 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGGCAA TGGTAGTTGA TATGAATGAC AAATACGCTG TTGTTGTTAA TAAAGAGGGT CAATACATTA AGATTAAGAG GAAAGCAGAG CATAGATTGG GCTATCAAGT TGAATTGCCG GACAGAGTGA TTGGATTTGA AAGAAGAACG TTATTGAAAG TAGTATCTGT GGCAGCAGCT CTGTTGATTG TTTCAAGTAT CTCCTTTGCC GTATACAGCT ATAATTTGCC TTACAGCTAC GTGAATGTTG ACATAAATCC CAGTTTGGAG ATAATCCTTA ATATGTACAA CCGGATTATT GATGTAAAAG CGTTAAATTC TGAAGGCGAG ATGCTGATTG AGGATTCTTA TAAGAATTCC CGGTTGGATG AAGGTGTGGA AAAAATTATT GACAGTGCCG TGGCACAAGG TTTCCTTAAA AATGATGAAG AAAATACCAT CATGCTTACC GTTGCAGGCA AGAATTCCAG AAAAGTTCTT GAAATAAAGG AAGAAGTGGA GAGTACAGCA AACAAGGTTT TAAATGATGA TAACGTGGTT TCCGAGGTGA TTGTTGAGAA CATAGTGCTG GAAAGACGCG AAGAAGCCAG AGAACTTGGT ATAGCTCCGG GCAAATTGCT TTTGATTGAA AAGCTTAAAG AAGTCGATCC CAAGGCAACT ACCGAAGAGT ACAAGGACAA ACCTGTGAAT GAGATTGTAA AAACCATCAG GGACATAAAG AAAGTTCCAA ATGAGAACAA CCGAAAGGAT GACGATAAAA AGGTAAACAA TGAGCCGAAT AAGCCATTAC CTGACAGAAA AGCCGATGTG GAAACAAGTG CCGGGGTAAA AGAAAATACC GCCGGTCCGG ATGCAGGCAT CAAACCGGTG AATAAAACCG ATAATGCTAA ACCCAATGTT GGTACCGACA TAAATAACAA AGAGAATAAA ACAGTCAGCA ATGCGAAGAT TGACAGCGGC ATTGACAAGG GCAACAAAGA CAGTAAACCC AACAGTAATA CTAAAATTAA TAACGACGTC AAAAAGGACA ACAAAGATAA TAAAACCAAC AGTGATGCCA AAACCTTCAA CGATGTCAGC AAAGACAACA AAAATGATAA AGCTGACGGC AATGCTAAAA TCAACAATAA CATCAACAGA GACAATAAAA TTACTCCGAT TAATCCGGAT AATAAATTTA GCAGCGGCGG CAGCAAAGAC GACAAAGATA ACAAGCATGT TGATAGCAAA GATAAAATGA ATAATGAAGA CAACAAAAAC ATTAACAATG GCAGCTGCCC CCAATACAAT CCATATTGGA ACCCTTACTG GAATCCCTAT TGGAATCCAT ATTGGGGAAA TCCGAAAGAA AAAGAGGATA TGACAAAGCA AAATGATGAA TGGTTTAAAA AGATGCAGGA AGAACAAAAG AAACAGTACG ATGAATGGCT GAAAAAGATG CAGGAGGAGC AAAAAAAGCA GCATGATGAG TGGGTTAAAA AGATGGAAGA AATGAAAAAT ACGGAAAAGA TGAAAAATCC ATACCAGGAA AATAAAATTG AAAAACCCAA AGAGGCAGAA AAGGAGAATA AACCGGACAG ACCTCCGGAG CCGGGAAAAG AAATTTTGAA GAAAAGATGC TAA
|
Protein sequence | MRAMVVDMND KYAVVVNKEG QYIKIKRKAE HRLGYQVELP DRVIGFERRT LLKVVSVAAA LLIVSSISFA VYSYNLPYSY VNVDINPSLE IILNMYNRII DVKALNSEGE MLIEDSYKNS RLDEGVEKII DSAVAQGFLK NDEENTIMLT VAGKNSRKVL EIKEEVESTA NKVLNDDNVV SEVIVENIVL ERREEARELG IAPGKLLLIE KLKEVDPKAT TEEYKDKPVN EIVKTIRDIK KVPNENNRKD DDKKVNNEPN KPLPDRKADV ETSAGVKENT AGPDAGIKPV NKTDNAKPNV GTDINNKENK TVSNAKIDSG IDKGNKDSKP NSNTKINNDV KKDNKDNKTN SDAKTFNDVS KDNKNDKADG NAKINNNINR DNKITPINPD NKFSSGGSKD DKDNKHVDSK DKMNNEDNKN INNGSCPQYN PYWNPYWNPY WNPYWGNPKE KEDMTKQNDE WFKKMQEEQK KQYDEWLKKM QEEQKKQHDE WVKKMEEMKN TEKMKNPYQE NKIEKPKEAE KENKPDRPPE PGKEILKKRC
|
| |