Gene Cthe_1269 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1269 
Symbol 
ID4809774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1543014 
End bp1544297 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content44% 
IMG OID640106692 
Producthypothetical protein 
Protein accessionYP_001037694 
Protein GI125973784 
COG category[S] Function unknown 
COG ID[COG2006] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTCAA AGGTAGCAAT TGTCAGAACC AAACCGTCCA CGGTACTTGA AGATTATCAT 
AGACTGATGA ATCTTGCCGA ATATCAAAAC TATATTTCCA AAGACGTTGA CACTGCACTT
AAAATAAATA TAAGCTGGCA TTTTTTCTAC CCGGGAAGTT CAACAACGCC CTGGCAGCTG
GAAGGTGTTA TCCGGGCTTT AAAACGTGAC GGATATAATC CTGAGCTTAT TCACGGCTGC
CACAACAGAA CGGTGGTAAT TGACGCTCAC CTGGGAGAGC GGGAAAACAA ACAGATTAAT
GTTATAAAAG CCCACAATCT TCGAAACATT CATCTCTACG AAGGCGAGGA ATGGATTGAC
ATAAGGGAGG CAGTGGGAGA TCTTACAAAA AAATTTTTGT GTCTGAACGA AGTTTATCCG
AAAGGATTTT CCATACCCAA AAGGTTTATT GGTGAAAACA TTATACATCT TCCCACTGTG
AAAACCCACG TATTTACCAC CACTACCGGA GCAATGAAAA ACGCTTTCGG CGGACTTTTA
AATGAAAAAA GGCATTGGAC TCACCCGGTG ATTCATGAAA CTTTGGTGGA TCTGCTTATG
ATACAAAAAA AGATCCACAA AGGGATTTTT GCAGTTATGG ACGGAACTTT TGCAGGAGAC
GGACCGGGTC CCCGCTGTAT GGTTCCCCAT GTAAAAAACG TGCTTTTGGC TTCGGCGGAT
CAGGTAGCTA TCGATGCCGT GGCGGCAAAA CTGATGGGTT TCGACCCGCT AAAGGACTGT
AAATACATCC GTTTAGCCCA TGATGCAGGC CTTGGCTGTG GCGACGTAAG ACAAATCGAA
ATTGTCGGAG ATGTTGACGC CTTGAATGAA AACTGGAATT TTGTCGGCCC CTATAAAAAA
ATGACCTTTG CAAGCAAATG CCAGCACCTG ATTTACTGGG GACCTTTGAA AAAGCCGGTG
GAATGGACAT TAAAAACAAT CCTGGCCCCC TGGTCCTACA TTGCCAGTGT TGTTTATCAT
GATATGTACT GGTATCCGAA AAACTATGGC AGGGTCGAGG AAATTCTAAA TTCGGACTGG
GGACGGTTGT TTGCAAACTG GGAGCAGCTT CAGCTGCCTG CGGATGACCT GTCGGTTCCG
GGCTGGGAGC ACGTCGGAGA CAAACCGCTA AAACTTGACA AAGAAACAAG TAAAATGATA
CGCAAAGCTT TCAGAGTTCT TGGAACCGCA ATCAGGGAAG CTCCGGAGTT TAGCGCAAAG
AAATCAAAAA AAGCATGCAA ATAA
 
Protein sequence
MKSKVAIVRT KPSTVLEDYH RLMNLAEYQN YISKDVDTAL KINISWHFFY PGSSTTPWQL 
EGVIRALKRD GYNPELIHGC HNRTVVIDAH LGERENKQIN VIKAHNLRNI HLYEGEEWID
IREAVGDLTK KFLCLNEVYP KGFSIPKRFI GENIIHLPTV KTHVFTTTTG AMKNAFGGLL
NEKRHWTHPV IHETLVDLLM IQKKIHKGIF AVMDGTFAGD GPGPRCMVPH VKNVLLASAD
QVAIDAVAAK LMGFDPLKDC KYIRLAHDAG LGCGDVRQIE IVGDVDALNE NWNFVGPYKK
MTFASKCQHL IYWGPLKKPV EWTLKTILAP WSYIASVVYH DMYWYPKNYG RVEEILNSDW
GRLFANWEQL QLPADDLSVP GWEHVGDKPL KLDKETSKMI RKAFRVLGTA IREAPEFSAK
KSKKACK