Gene Cthe_1368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1368 
Symbol 
ID4809363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1661035 
End bp1663167 
Gene Length2133 bp 
Protein Length710 aa 
Translation table11 
GC content40% 
IMG OID640106792 
ProductS-layer-like domain-containing protein 
Protein accessionYP_001037793 
Protein GI125973883 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value4.86639e-05 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT TCCATAACAA ACAACTTCAA AATATCATCT GTAGTATTAT TGCCATTGTT 
ATGATTTTTG GTGTTATTGC GACGGTGGCA TATGCAGAAG TGGTACAGCC GGATGCGTTT
CAAAAGGTAG TATTGCAGGT GATTAGTCTT ACAGAGCAAC AGAGGATAAA TTTTGTGAAC
AATGTTCTTG ATCAGATTAC TACTGAGAAC TACAAAGACT ATGTGGATGA TGCAAAAGCA
ATTCTTGGTT TGTCAATGTC TGACTCGGAC ATGGAAAAGG CATTGAAGAG TTATGCTTAC
TATCCGGACA CGCACAAAGG CACTTTGAAG GAATTGATAA AAATCTTCGA ATTGCCTTCA
AACCAGTACG ATACGTCAGC TTTTTCAGAC ATTGAGAAGA GGATTAATTT CAACATAACA
GGAGATTCCA ATGACAGCAG AGGTTTCAAA CTGTTTGTTG AAATTTTCAA AGCCATCAGG
AAATTTAATA ATGCACCTGT ATTCTATGAT GGTTCTGATG ATATTTACAA ACTTGATATT
ACAGTTCAAG GAAACAATAC GTTGAAGAAT GAAATTAATG CTCTTATATC TTCCGTAGAG
TCTCTTACCA AGAAAGATAT TAATGATTTT GATTCGTTTA CAGAGTATAT CGAAAATGAA
ATTAATTCGA ATTCTTATGG ACAGATATAT GGATTGAAAG TATTCTTGAA AGATGAACTG
GGATCCAGTA CATATGCCGG TACTTTGCCG GAGCCTTCTG AGATAGATCC TTTACAAAAA
GTATTCCTGG CGATTATCAG TCTCCCTCAG AAGGATAGAA AAGCATTTGT AGAGAATGTG
CTTGTTAAAC TTACACCGGA CAATTATGCC GATTATGTTT ATGAAGCAAA GAAAATCCTG
GGCGTGACAA TTACTGACGG ACAAATGAAA GCTGCGCTCA AAGTATATGC GAGCTATTCG
GAAACTTATC GGGCAAAAAT TGAAGGCATG ATATTGGCAT TTGATTTGTC GGCAATTAAA
ATTGATACTT CCTTATTTGC TGATCTGGCA GCCGATATCA ATTTTGAAGT TACAGGAGAT
GCTAATGATG ACAGGGGAAT AAAATTTGTT GTAAATACAT TGGATTGGCT GTCACAATTT
ACTGGACCGA TAGTATTTGA CGGAACTGAG GTTCCTTACA AGGTGGATTT CAGAATTCAG
GGCAATGCGA CAATGAAACG TCATCTTGAC GGATTGATAA AACTGATTAA GTCTTTGGAA
AAGAGAGGAG TTACGAATTT TGATTCTTTC CTCGTTTTGG CTGAGGATAT AGTTAATGCT
AATGACAATA TTCAGATTTA CAACTTTAAG AAGCTGTTGC GCGATTTGTA TGGAAGAAGT
GTTTATGATG GAGAACTTCC ATATCCGCCA GTGACACCTA CACCTGCACC GACATCCGGT
GGCGGTTCGG GTGGCTCCGG TGGTTCCGGT GGAGGATCAA CTGCTACACC GGCTCCTACA
CCTACACCGA CATCCACTTC CATAGAAGAA CCGACACCGA GTGATGTGCC TGCAGCTCCA
TTTAACGACA TTGCAGGTCA CTGGGCAGAA GAATTCATTG CAAAACTGGC AGCGAGGAAC
GTTGTAAGCG GTTATCCTGA CGGAAGCGTT AAACCTGATA TTGAGATAAC AAGAGCTGAA
ATGGCTGTAA TTGTTGTTAA GTCTGCAGGA CTTGAACCGG TTGAAAATGT ATCATTGAAG
TTTAAAGATG CCGATCAGAT ACCTGCGTGG GCAGCGGGTT ATGTACAGGC AGGAGTTGAG
GCAGGTATCA TCGCAGGATA TGAAGACAAT ACATTCAGAC CGTCAAGAAA TCTGACCAGA
GAAGAAATGG TTGTATTAAT AATGAAGGCT TATGAATTTG GAGCTGTAGA AAATCCTGAG
TTTAGCTTCA TAGATGCAAG TGAGATTGGA GATTGGTCCA AGCCGTTTGT TGGAAAGGCT
GTTGAATTGG GATTTGTTGT AGGATATCCG GATAATACAT TTAAGCCTAA GAAGAGTGTA
ACCAGAGCTG AAGCATTTAC AGTTCTCTGG AAAGCAATAG AGGCTAAAGA AGCGGCAAAT
GCCCCTGCTG AAGAGGCTGA AGTTGCTGAA TAA
 
Protein sequence
MKKFHNKQLQ NIICSIIAIV MIFGVIATVA YAEVVQPDAF QKVVLQVISL TEQQRINFVN 
NVLDQITTEN YKDYVDDAKA ILGLSMSDSD MEKALKSYAY YPDTHKGTLK ELIKIFELPS
NQYDTSAFSD IEKRINFNIT GDSNDSRGFK LFVEIFKAIR KFNNAPVFYD GSDDIYKLDI
TVQGNNTLKN EINALISSVE SLTKKDINDF DSFTEYIENE INSNSYGQIY GLKVFLKDEL
GSSTYAGTLP EPSEIDPLQK VFLAIISLPQ KDRKAFVENV LVKLTPDNYA DYVYEAKKIL
GVTITDGQMK AALKVYASYS ETYRAKIEGM ILAFDLSAIK IDTSLFADLA ADINFEVTGD
ANDDRGIKFV VNTLDWLSQF TGPIVFDGTE VPYKVDFRIQ GNATMKRHLD GLIKLIKSLE
KRGVTNFDSF LVLAEDIVNA NDNIQIYNFK KLLRDLYGRS VYDGELPYPP VTPTPAPTSG
GGSGGSGGSG GGSTATPAPT PTPTSTSIEE PTPSDVPAAP FNDIAGHWAE EFIAKLAARN
VVSGYPDGSV KPDIEITRAE MAVIVVKSAG LEPVENVSLK FKDADQIPAW AAGYVQAGVE
AGIIAGYEDN TFRPSRNLTR EEMVVLIMKA YEFGAVENPE FSFIDASEIG DWSKPFVGKA
VELGFVVGYP DNTFKPKKSV TRAEAFTVLW KAIEAKEAAN APAEEAEVAE