Gene Cthe_3064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3064 
Symbol 
ID4809938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3604489 
End bp3606096 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content40% 
IMG OID640108488 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_001039453 
Protein GI125975543 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID[TIGR02900] stage V sporulation protein B 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000038595 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAC AGTCAATCAC AAAAGGTTTT GCGGTACTGT CGGCGGCAGG ACTTATAACA 
AAAATATTAT CGGTGCTTTA TATTCCTTTT TTGCTTGCCA TTATCGGAGA TGAGGGAAAT
GGTATTTATG CTGCCGCGTA TCAAGTATAT GTTTTTATTT ATGTTATTGC CAATTCAGGC
ATTCCCGTTG CGATAGCAAA GTCCGTGTCT GAGCTTACCG CTGTGGGAAA CTACAAGGAT
GCTTTGAGAA TCTTTAAAAT ATCGCGTTTT TTCCTTATTA TAATAGGTAC TGTTTTAACA
GTGCTTATGT TTGTCACGGC AAAGCCATTG GCTGTCATGA TTAACTCGGA AAAATCATTC
CTTGCAATAG CGGCATTGTC TCCGACACTG TTTTTTACCG CCCTGGCCTC TGCGTACAAG
GGATATTTCC AGGGCATGAG CAACATGACT CCGACTGCCG TGTCCCAGGT GGTTGAACAG
ATATTCAATA TGATATTCAC AGTGCTTTTT GCAGCGTTGC TAATAAATAA AAGCCTTGAG
GCTGCGTGCG CCGGAGGTAC CGTAGGAACA ACTGTGGGTG CGCTGGCTTC CGTTATTGTC
CTTATATTTA TATACAACAG AAGAAGAGAA GAAATTAACA ATCTGAAGGA ACACAGGAAG
ACTGCAAAGA GATACTCATA CAAGCAGCTT GCGACAAGAA TATTTTATTA CAGCCTACCC
ATAACTGTTT GTGTGGCTGC TCAATATGCT GGAAATCTCA TTGATGTGGC AAATATAAGA
GGGCGTCTTT TGGCCGGCGG CTATACGCTG GAAATGGCAT CGGTCATGCA CAGCTATTTG
TCCAAATACC AGCAAATAAT GAACGCACCG ATTTCCATAG TTTCGGCTCT TGCGGCGGCG
GTGCTGCCTT CCATTTCGGG AGCTGCGGCG GAACAAGATA TAAAGCAGGT TAAGGATAAA
TCCAACCATG CTTTCAGGCT TTGCATGCTG ATAGTAATTC CGTCGGCTGT GGGGTTGTCC
ATATTGAGTG AACCTATTTA CGCCGTATTG AAATACGGAG CGGGTTCCCA CCTTATGCGC
TACGGCTCAA TAGTACTCGT TCTCATGTCC ATTGTACAAA TACAGTCGTC AATTTTGCAG
GGTGCAGGAA AACTGTACAA AGCAACGATA AATGTAATTT TAGGTATTAT CGCAAAGATA
ATTTTCAATT ATATACTTAT AGCAAATCCC AATATAAATA TCATGGGAGC AGTGATAGGA
AGTATAGTGG GATACGGTTT GACCATTATT CTCAATGTTA TGACAGTAAG AAAAGAGTTG
AAAATAAAAA TAAATATACT GAAACAGGCG GTAAAACCGG CTGTTTCATC AGTGGTAATG
GGTATTTTTG TATGGATTGT ATACAAGGGT TTATACTTTG TTTTAGGATT TATTAAGAGC
GCATATCTTG TAAACGCATT ATCTACAGTT GTTTCAGTTC TGTTCGGAAT GGCAATATAT
TTTTATATAA TGATACTTGT CAGGGGAATA ACAAAAAATG ATTTTGACGT ATTGCCGGAA
AAAATCAGAA GAATGATACC CAAATTCGTA TTAAACAAAG CCGTATGA
 
Protein sequence
MKKQSITKGF AVLSAAGLIT KILSVLYIPF LLAIIGDEGN GIYAAAYQVY VFIYVIANSG 
IPVAIAKSVS ELTAVGNYKD ALRIFKISRF FLIIIGTVLT VLMFVTAKPL AVMINSEKSF
LAIAALSPTL FFTALASAYK GYFQGMSNMT PTAVSQVVEQ IFNMIFTVLF AALLINKSLE
AACAGGTVGT TVGALASVIV LIFIYNRRRE EINNLKEHRK TAKRYSYKQL ATRIFYYSLP
ITVCVAAQYA GNLIDVANIR GRLLAGGYTL EMASVMHSYL SKYQQIMNAP ISIVSALAAA
VLPSISGAAA EQDIKQVKDK SNHAFRLCML IVIPSAVGLS ILSEPIYAVL KYGAGSHLMR
YGSIVLVLMS IVQIQSSILQ GAGKLYKATI NVILGIIAKI IFNYILIANP NINIMGAVIG
SIVGYGLTII LNVMTVRKEL KIKINILKQA VKPAVSSVVM GIFVWIVYKG LYFVLGFIKS
AYLVNALSTV VSVLFGMAIY FYIMILVRGI TKNDFDVLPE KIRRMIPKFV LNKAV