Gene Cthe_1328 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1328 
Symbol 
ID4809468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1611804 
End bp1613081 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content40% 
IMG OID640106752 
Productstage II sporulation P 
Protein accessionYP_001037753 
Protein GI125973843 
COG category 
COG ID 
TIGRFAM ID[TIGR02867] stage II sporulation protein P 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.944507 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGTAT ACGGTTTGAG AAAAAGAAAG TTTGATTATG GCAAGTTGTT TAAGATTGCC 
CTGATAATAA TACTTTCAAT CGGGGCAATA AAGATTGGGG CAATTGCCGG AGACGTGCTC
TACAAGTCTG ATAAAAAGAT TATTGGAAAG ATTGAAGTTG AAACTTTAAG AGCCACTCTC
AATGCTTCAC TTCCTATAAT TGACACCATT TACAACAGCG GCAATATCAG TTTTTCGATT
TCGGGACAGA TAAAAGAAAT TATTAATCTG GTTTTCTATT TTGATCTCAG TAATCCTGTT
ACAATTTTTG GGGCAGAGTC TCCAATATTC TATAGTTACT ATATGAATGA GTACCAAAAA
CAGCTGGCCC AAAATCAAAA CTGTGAACCT TACTTCTATA TGGCGGATTT GGATACGCCG
GATGACAATA GCGACAAGGT CAAAAATCCT GACAATAATG CTCCTGAACC CACATATCCG
GCCAGCAGCA TAAGTTATGA GGTAAATGAG TTGGACAGAA CCGGAACTCC TGAGAATGCT
ACCACTGTTA CGGCGGACAA AATTGCAATC AACAGTCATG AAGTTGACTA TGAAATTGAT
GTTGAAAAGC TTCTTAACGA ACCTTTAAAC ATCAGCTTTG ACAAAAAGGG TCCCAAAGTT
CTTATATACC ATACACATAC CACGGAAGGA TTTATTAAAG ACCTAAGCGA GCTGGATAAA
AGTGGTATTC CAAGCAGAAC CACCGATAAC AGATACAATG TAGTAAGAGT TGGGGAGGAA
CTGGCTCAGA CATTAAGGAA AAAATACGGT ATTGAAGTGA TTCATAACGC CACTGTTCAC
AATCATCCCT CGGACACAGG AGCTTATGGT AGATCCCTTA ATACTGCGGC CAACATTTTA
AAAAGTTATC CTTCAATAAA AATAGTCCTG GATATTCACA GGGACGGGCT GGGCGAAGGT
AAACTTAGAG TGGCGACCAA GATTAATAAC AAGGATGCGG CAAAAATAAT GTTTGTGGTG
GGAACCGACG GGACAGGGCT TGAGCATCCT AACTGGCGGG AGAATTTGAA ATTGGCAATC
AAGCTTCAGC AAAAGCTTAA TGAAAAGTAT CCCGGTATCA CAAGACCGAT TTATATAAGC
CGCAACCGCT ACAACCAGCA CCTTACCAAC GGTTCTTTGA TTGTTGAAAT CGGAGGGGAT
GGCAATACAA TAAATGAATG TTTGGAGAGT ACGAAATATC TTGCCGAGGT TTTAAACGAT
GTCATTAATA ATAAATAA
 
Protein sequence
MRVYGLRKRK FDYGKLFKIA LIIILSIGAI KIGAIAGDVL YKSDKKIIGK IEVETLRATL 
NASLPIIDTI YNSGNISFSI SGQIKEIINL VFYFDLSNPV TIFGAESPIF YSYYMNEYQK
QLAQNQNCEP YFYMADLDTP DDNSDKVKNP DNNAPEPTYP ASSISYEVNE LDRTGTPENA
TTVTADKIAI NSHEVDYEID VEKLLNEPLN ISFDKKGPKV LIYHTHTTEG FIKDLSELDK
SGIPSRTTDN RYNVVRVGEE LAQTLRKKYG IEVIHNATVH NHPSDTGAYG RSLNTAANIL
KSYPSIKIVL DIHRDGLGEG KLRVATKINN KDAAKIMFVV GTDGTGLEHP NWRENLKLAI
KLQQKLNEKY PGITRPIYIS RNRYNQHLTN GSLIVEIGGD GNTINECLES TKYLAEVLND
VINNK