Gene Cthe_1481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1481 
Symbol 
ID4810631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1800208 
End bp1802520 
Gene Length2313 bp 
Protein Length770 aa 
Translation table11 
GC content42% 
IMG OID640106902 
Productmembrane protein-like protein 
Protein accessionYP_001037903 
Protein GI125973993 
COG category[S] Function unknown 
COG ID[COG1511] Predicted membrane protein 
TIGRFAM ID[TIGR03057] X-X-X-Leu-X-X-Gly heptad repeats 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0291099 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGTA AAATGATAAA AATCATTTCT CTTTGCCTGT GTATAGCCAT CCTTGCCAGC 
GGAGCGGCTT ACGCCCTCGC ATCAGGTAAA GACAAAGCTG AAAATGTTAA AACAGCGGAA
AACATTACGG AAAACACTGC AGAAAAACAA CATGAAGAAG CTGCGGCAAA AAGCGATACT
TTCAAAGATG AAACAGTATA TGTTCTTGCA AACCCTGACG GTACGGTGCA AAAAATTATT
GTAAGCGACT GGATAAAAAA CACGCTTTCA AGTGAAAAAA TCACTGATGT AACCGAACTT
GAAAATACAG AAAATATCAA AGGTGACGAA AGTTATACCC TGGGAGGTAA CAACACCCGT
GTATGGGATG CGCAGGGAAA TGACATCTAT TATCAGGGCA CAATTGAAAA AGAACTGCCG
GTGGATCTTT CGGTTTCTTA CAAACTTGAC GGAAAAACAG TATCCGCAGA CGAGCTCATA
GGCAAAAGCG GCAAAGTAAC CATCCGCTTC GACTACAAAA ACAAGCAATA TGAAACCGTA
AAAATAAACG GCAAAGATGA AAAAATCTAC GTTCCCTTTG TAATGCTGAC GGGAGTGCTC
CTTGACAACG ACAACTTTAC CAATGTTCAG GTTTCAAACG GAAAAATCAT TAACGATGGA
AACAAAACGG CAGTTTTAGG GTTTGCTTTA CCGGGGCTTC AGGAAAATCT TGCAATAGAC
AAGGAAAAGT TTGAAATACC GTCTTATGTG GAGATAACAG CCGATACCAC CGATTTTTCT
CTGGGAATTA CCGTTACTGT AGCAACCAAC TCATTATTTA ACAATATAGA TGTGGAAAAA
ATTGATTCGA TATCTGATTT GACCGATTCA ATGAATGAAC TTGATGATGC CATGACAAAA
CTTCTTGACG GTTCTTCTTC GCTGTACAAC GGCATCTGCA CGCTTCTTGA CAAGTCAAAG
GAACTTGTTG AAGGAATTAA TCAACTTGCA GAGGGTGCCG GAAAGCTGAA AGAAGGAGCA
TATTCCCTTG ACGAGGGAGC GAAAAAACTG TATACCGGAG CGGCAGCGTT GTCACAGGGC
CTTGATACGC TTTCGGCAAA CAATGATACA TTAAACGACG GTGCAAAAAA AGTTTTTGAA
ACTATCCTTT CCACCGCCGA GACACAATTA AAAGCATCCG GGCTTACAGT TCCTGAATTG
ACCATCGAAA ATTACGCTTC AGTACTCAAT GAAATTATTG AATCCCTTGA CAGTACAAAA
GTATATAATC AGGCTCTCGA ACAGGTTACA GCAGCCGTTG AAGAAAAACG GGACTATATC
AAATCACAGG TCGTTGAAGC CGTGCGTTCG GAAGTCGAAG CCAATGTCAC CGCCGCAGTT
ACGGAACAGG TTAAAGCAGA GGTTACAAAA GCAGTAAAAG AACAAGTGTC GGAAAAGGTC
ACTGCAAATG TTCGTGAGAA CGTTGAAGAA CAAGTAATTC TTGCTGCAAC AGGTATGAAC
AAGGCAAGCT ATTATGCCGC AGTTTCAGCC GGACGCGTGG ATGCGGCAAC ACAAAAGGCC
GTCAAATCCG CCATTGACAA CAAAATGGCA AGTAAAGAAA TTACCGATTT AATCTCATCA
AATATTGACA AACAAATGCA GAGCGAGCAA GTTTCCGCAA TGATTTCGCA AAAAGTCGAT
GAACAGATGC AGACAGAAGA AATTAAGAAT ACCATTAAGA AAAATGCTGA GCTAAAAATG
GCAGAAAAGG ATATTCAGCA ACTGATTGAA CAAAATACAG AAGCACAGAT ACAAAAAGTC
ATATCCGAAA ACATGGCAAG TGAAGAAGTG CAATCAAAAC TTGCTGCAGC ATCGGAAGGG
GCAAAATCAG TTATTGCGCT CAAAACTTCC CTTGACGACT ACAATTCATT CTATCTTGGA
CTTATGAAGT ATACTGCAGG TGTCGCCCAG GCGGCAAGCG GTGCTTCTGA TTTGAAAAAC
GGAGCAAATG AACTGCAGAA CGGAACGGGT ACTTTATATA AGGGTGTTTG CTCCCTTTAT
GACGGAATAC TGACAATGAA AAACGGTCTG CCTGCACTCG TAGACGGTAT CACACAGCTT
AAAGACGGAG CAATGCAGCT TTCAGACGGG CTTTCAAGGT TCTATGAAGA AGGTATTCAG
AAATTAATCG ATACGGTCGA TGATGCCGAA AATCTTATCG AACGGCTGAA AGCTACCGTA
ACCGTATCAA AGAATTACAA GTCTTTTGCC GGAATAAGCG AAAACGCTGA CGGTAATGTC
AAGTTTATTT ACCGCACAGA CGAGATTAAA TAG
 
Protein sequence
MKSKMIKIIS LCLCIAILAS GAAYALASGK DKAENVKTAE NITENTAEKQ HEEAAAKSDT 
FKDETVYVLA NPDGTVQKII VSDWIKNTLS SEKITDVTEL ENTENIKGDE SYTLGGNNTR
VWDAQGNDIY YQGTIEKELP VDLSVSYKLD GKTVSADELI GKSGKVTIRF DYKNKQYETV
KINGKDEKIY VPFVMLTGVL LDNDNFTNVQ VSNGKIINDG NKTAVLGFAL PGLQENLAID
KEKFEIPSYV EITADTTDFS LGITVTVATN SLFNNIDVEK IDSISDLTDS MNELDDAMTK
LLDGSSSLYN GICTLLDKSK ELVEGINQLA EGAGKLKEGA YSLDEGAKKL YTGAAALSQG
LDTLSANNDT LNDGAKKVFE TILSTAETQL KASGLTVPEL TIENYASVLN EIIESLDSTK
VYNQALEQVT AAVEEKRDYI KSQVVEAVRS EVEANVTAAV TEQVKAEVTK AVKEQVSEKV
TANVRENVEE QVILAATGMN KASYYAAVSA GRVDAATQKA VKSAIDNKMA SKEITDLISS
NIDKQMQSEQ VSAMISQKVD EQMQTEEIKN TIKKNAELKM AEKDIQQLIE QNTEAQIQKV
ISENMASEEV QSKLAAASEG AKSVIALKTS LDDYNSFYLG LMKYTAGVAQ AASGASDLKN
GANELQNGTG TLYKGVCSLY DGILTMKNGL PALVDGITQL KDGAMQLSDG LSRFYEEGIQ
KLIDTVDDAE NLIERLKATV TVSKNYKSFA GISENADGNV KFIYRTDEIK