Gene Cthe_2488 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2488 
Symbol 
ID4809426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2957545 
End bp2959380 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content41% 
IMG OID640107903 
Productphage tape measure protein 
Protein accessionYP_001038883 
Protein GI125974973 
COG category 
COG ID 
TIGRFAM ID[TIGR02675] tape measure domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACTT TAAAAGCAGT TATGGCTTTA TTGACTGGAG GTTATACATC AGGAATCAAT 
AAAGTCATTA AAAATACAGA TAAAGCAACA GATAAAATTT TAAAAGCTAG CGGTGCCACA
GATGAATTTA ATAAAAAATT AGAAGTCACT GGCGCAAGTG CTAATACTGC AAGTGGTGGA
TTGGGGAAAT TACTTAAAAC TTTTATAAGT TTAGCAGCGA TAAAAAAAGG AATAGATATT
ACAGACGAAT ATAGTAATAT AGCTGCTAGA CTCGCACTTA TTAATGACGG CTTGCAAACA
CAAGAAGAAT TGCAAAATAA AATCTTTGCA GCTGCTAATC GGTCTCGCGG TGTATACTCA
GATATGGCCA GTGCAGTGGC CAAAATGGGG CTGCTAGCCA AGGACGCTTT TACCTCCAAT
GATGAACTAA TTGCCTTTAC AGAGCTTGTA CAAAAATCAT TTAAAATTAG CGGAGCTGAC
CCATCTGAAC AGGCAGGAGC AATGAGACAA TTAGCTCAAG CGATGGCTTC TGGTAGGCTT
CAAGGTGATG AATTAGTATC AATAATGGAA AATGCTCCAA TGATATATGA GGCAATAGCA
AAATATATGG GAAAGACAAA AGGAGAACTT AAAAAATTAT CTTCTGAAGG AGCTATAACG
GCCGACATAA TCAAAAATGC CGTATTTGCC GCAGCGAAAA ACATCAACAC CAAGTTTGCA
GAGATGCCAA TGACTTTCGG AGACATATGG AACAGGATTA AGAATGGTGC ACTTAAGGCC
TTTGATAAAG TTATTGTAAA GGTAAATCAG CTTATTAATG CTGACAAGTT CCAGCGATTT
GTAGACAGAA TGATTACTGG TTTTAGTCTT GCAGCATCTG CGGCAAGCTG GTTAATCGAT
GCTATAATTA GAGGTTGGGA TACGATAGGG CCAATACTTG CAGTTATTGC TGGCATATGG
CTTGTTTCTA TAATTGGAAA ACTGTGGGCA ATGATACCAC CACTGATTGC GCAAGCAGCA
GCATGGTTAA GTGTATATTG GCCTATACTA CTGGTAATTG CTATTATAGG AATAGCAATA
TCTGCAGCAA GACAGCTGGG AGCAACATGG GATGAGATTA TAGGATTTAT TGGAGGGCTA
ATCGGTGTTT TTGCCACAAC TTTCTATAAC TATTTCGTCA TGATCTGGAA TCACATAGCC
GCTTTCGTGA ATTTCTTCGG CAACGTATTC AAAAACCCAG TAGCTGCTGT ACAAGCGCTG
TTTTTTGATC TAGCATCTAA CTTGCTTGGG TATATCGAAA AAGTGGCCCG GGGAATTGAA
GATTTGCTGA ACAAGATCCC GGGCGTGAAC GTAAATATCG CCGGAGCCAT CACAAAACTG
AGAGACAAAC TAAAAGCGGC ATCAACGCAG ATAAAAACCG AAGCCGACCT GAAAACCTAT
GTTCAATCCA AAGAATTCAT GGATTTCTCT GAAGGTTGGA CGAAAGGCAG CACCATGGGG
AAAAATCTTG TAGACAAGGT AAGCAACGCA TTGTCAGGGC TGACTGATAT AGGCAAAAGT
TTTGACATGG GGCAATTCGG TACAAGTCAA AACCCGCTAT ATGTCACATC TAACGATAAG
CTTAAGGTGG ACATGTCGGA TGAAGACTTG AAGTATTTGC GAGATATCGC AGAAAGAGAA
TACATTGCCA AATTCAGCAC CGCAACGCTT GCACCTAACA TCAGTATATC CTTTGGAGAT
GTACACGAAA CAGCGGATGC CAATAAGATA GCGGGAAGAA TTAGAAAAAT ACTCCAGGAA
GAAATCGCTA TGGCGGCAGA GGGGGCATAT GCATGA
 
Protein sequence
MATLKAVMAL LTGGYTSGIN KVIKNTDKAT DKILKASGAT DEFNKKLEVT GASANTASGG 
LGKLLKTFIS LAAIKKGIDI TDEYSNIAAR LALINDGLQT QEELQNKIFA AANRSRGVYS
DMASAVAKMG LLAKDAFTSN DELIAFTELV QKSFKISGAD PSEQAGAMRQ LAQAMASGRL
QGDELVSIME NAPMIYEAIA KYMGKTKGEL KKLSSEGAIT ADIIKNAVFA AAKNINTKFA
EMPMTFGDIW NRIKNGALKA FDKVIVKVNQ LINADKFQRF VDRMITGFSL AASAASWLID
AIIRGWDTIG PILAVIAGIW LVSIIGKLWA MIPPLIAQAA AWLSVYWPIL LVIAIIGIAI
SAARQLGATW DEIIGFIGGL IGVFATTFYN YFVMIWNHIA AFVNFFGNVF KNPVAAVQAL
FFDLASNLLG YIEKVARGIE DLLNKIPGVN VNIAGAITKL RDKLKAASTQ IKTEADLKTY
VQSKEFMDFS EGWTKGSTMG KNLVDKVSNA LSGLTDIGKS FDMGQFGTSQ NPLYVTSNDK
LKVDMSDEDL KYLRDIAERE YIAKFSTATL APNISISFGD VHETADANKI AGRIRKILQE
EIAMAAEGAY A