Gene Cthe_1197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1197 
Symbol 
ID4810149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1426715 
End bp1428130 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content43% 
IMG OID640106619 
Producthypothetical protein 
Protein accessionYP_001037622 
Protein GI125973712 
COG category 
COG ID 
TIGRFAM ID[TIGR01445] intein N-terminal splicing region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000101518 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCTGCCG CAGTAGTTAT CACAGGATTG GGAATAGCAG CGGCGCTGAC AGGAGGAGTA 
TTGGGAGTTA TACTGGCAGG AGCGTTCTGG GGAGCATTGG CCGGAGGCTT GATAGGAGGA
GCAGTCGGAG GAATAGCAGC GGCGATAAAT GGAGGTTCGT TCCTAGAAGG ATTTGCGGAT
GGCGCTTTAA GCGGAGCGGT TTCCGGAGCG GTGACAGGAG CGGCATGTGC CGGGCTTGGT
GCTTTGGGAG CAGCGGCAGG GAAAGGCATC CAATGCATGA GCACTGTAGG AAAAGCAATA
AATGTTACTT CGAAAGTAAC TGCAGCCCTT TCGTTGGGTA TGGACGGATT TGACATGCTG
GCAATGGGAG TATCGCTGTT TGATCCATCC AACGCATTGG TTGAATTTAA CCAGAAGCTG
CATTCCAATG CACTTTACAA TGGATTCCAG ATTGCAGTAA ACGCGCTGGC TGTTTTCAGT
GCCGGGGCGG CATCTACAAT GAAGTGCTTT GTTGCGGGTA CGATGGTATT GACAGCGGCA
GGCTTGGTTG CGATAGAGAA TATCAAGGTA GGAGATAAGG TAATTGCGGC GAATCCGGAG
ACTTTTGAAG TAGCCGAGAA GACAGTGCTT GAGACATATG TGAGAGAGAC AACGGAGCTT
TTGCATTTGA GAATTGGAGG CGAAGTAATC AAAACAACCG TTGACCATCC ATTTTATGTA
AAAGATGTTG GCTTTGTTGA AGCGGTGAAT CTGCAAGTCG GAGACAAGTT GGTTGATTCA
AAAGGCAATG TTTTGGTGGT AGAAGAGAAA AAGCTCAAAA TAACTGGTAA ACCTGTGAAA
GTTTACAACT TTAAAGTTGA TGACTTTCAT ACTTATCATG TTGGGAATAA AGGGATATTG
GTACATAATG CGAATTATAA TCCTAAAACT ACCTTTGAAA ATCTGGATTT GGAAACCGCC
AGTAACAAGC AAAAGGGTAA TTATGGAGAA TATCGAGCGG ATGATAATCT TATAAATAAT
CCAAAATTGA AGGAAGTAGG ATATGATTTG GAACAGATAG GAGGGAAAGT TCCGACATCA
CCGGATGATA AAATCACAAA AGGGATAGAT GGTATATATG TAAACAAGAA TCCTAATTCA
AATATTAAAT ATGTGATTGA TGAGTCAAAG TTTAATACTG CACAATTGGG GAAAACGAAA
AAAGGCATAA AGCAAATGTC GGATGAGTGG CTCCGTGAGA AACAAGGTAA AAGAATTTTA
CAAGCAGTTA ATGGTGATAG AAGACTGAAA GATGATATAA TAGAAGCATT AAACAACGGT
GCAGTAGAAA AAGTTTTATC ACGAGTTGGC AAGGATGGAA AAGTAACGAC GTATAGGTTA
AACAGCAATG GTGAAATAAT TGGATTCTGG CCATAA
 
Protein sequence
MAAAVVITGL GIAAALTGGV LGVILAGAFW GALAGGLIGG AVGGIAAAIN GGSFLEGFAD 
GALSGAVSGA VTGAACAGLG ALGAAAGKGI QCMSTVGKAI NVTSKVTAAL SLGMDGFDML
AMGVSLFDPS NALVEFNQKL HSNALYNGFQ IAVNALAVFS AGAASTMKCF VAGTMVLTAA
GLVAIENIKV GDKVIAANPE TFEVAEKTVL ETYVRETTEL LHLRIGGEVI KTTVDHPFYV
KDVGFVEAVN LQVGDKLVDS KGNVLVVEEK KLKITGKPVK VYNFKVDDFH TYHVGNKGIL
VHNANYNPKT TFENLDLETA SNKQKGNYGE YRADDNLINN PKLKEVGYDL EQIGGKVPTS
PDDKITKGID GIYVNKNPNS NIKYVIDESK FNTAQLGKTK KGIKQMSDEW LREKQGKRIL
QAVNGDRRLK DDIIEALNNG AVEKVLSRVG KDGKVTTYRL NSNGEIIGFW P