Gene Cthe_1509 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1509 
Symbol 
ID4810547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1833063 
End bp1834298 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content39% 
IMG OID640106929 
Producthypothetical protein 
Protein accessionYP_001037930 
Protein GI125974020 
COG category[S] Function unknown 
COG ID[COG2461] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAAA TAATTAATAA CCGCGAATAC AGGCAGAAAG TCTTGAAAGA GTTGATTAGG 
GAGCTGCATG ACGGAAAGAG TGTCGAAGAA ATAAAACCAA GGTTTGAAGA ATTAATAAAA
GGGATTTCTC CTGCTGAAAT TTCTGAAATG GAACAGGCTT TGATTATGGA AGGCATGCCC
GTTGAAGAAA TACAAAGACT TTGCGATGTA CATGCCGCTG TTTTCAAAGG CTCCATTGAA
GAAATACACA GACCGCAGAA ACCTGAAGAA GTGCCGGGGC ATCCTATCCA TACATTTAAA
CTCGAAAATG CCGAAATAAG GAAACTTGTC GATAATGAAA TCAGACCGCA GTTGGAATTG
TATAAAAATG GAGACACAGC TGAGAGTTTA AAGAAATTAA GAGAAGCGTT TCAAAAGCTT
TGGGAGATAG ACAAGCATTA TTCAAGAAAA GAAAATCTAT TATTCCCGTA CCTGGAGAAA
TACGGCATAA CTGCACCTCC CAAGGTAATG TGGGGTGTGG ATGATGAAAT CAGGGCAGAT
ATAAAAGAGA TAAACAACAA GCTTTCAGCG AATGCTCAAA ATCAAAATAT ACCTTTGGAG
AAAGCGGAAG AAGCGGTAAA CAGGGTTATT GAAATGATTT TTAAAGAAGA AAATATTCTT
TTCCCAATGG CTTTGGAGAC TTTAACCGAG GATGAATGGG CTGAGATTGC CAGGGCGAGC
GATGAAATAG GGTATTGCAT GATTACGCCT GAAGCGGAAT GGAAACCTGC CCGGGTGGAT
GTTGTGGAAA AAACACAAAA AGAGGGGGTT AAGTCTCAAG AAAACCAGGG CTTTGTTGAA
TTTGACGCGG GATGTCTTAC AACGGAAGAA ATAAATGCAA TGCTTAATAC TTTACCCATT
GATATTACTT TTGTGGATAA AAACGATACT GTAAAATACT TTACCCAGGG AAAGGAAAGG
ATTTTTGCCC GTCCTAAAAC AATTATCGGA AGAAAGGTAC AAAACTGCCA TCCTCCGGCC
AGTGTACACA TTGTGGAAAA GATTATAGAG GATTTGAAAT CGGGAAAAAA GGACCATGAG
GATTTCTGGA TTAAAATGGG TGAAAAGTAT GTTTATATCA GGTATTTTGC TGTAAGAAAC
GAAAAAGGCG AATACCTGGG AACAATAGAA GTGACACAGG ATATTGCTCC TATACAAAAA
ATTACAGGTG AGAAACGATT GCTTTCTGAT GCTTAG
 
Protein sequence
MSEIINNREY RQKVLKELIR ELHDGKSVEE IKPRFEELIK GISPAEISEM EQALIMEGMP 
VEEIQRLCDV HAAVFKGSIE EIHRPQKPEE VPGHPIHTFK LENAEIRKLV DNEIRPQLEL
YKNGDTAESL KKLREAFQKL WEIDKHYSRK ENLLFPYLEK YGITAPPKVM WGVDDEIRAD
IKEINNKLSA NAQNQNIPLE KAEEAVNRVI EMIFKEENIL FPMALETLTE DEWAEIARAS
DEIGYCMITP EAEWKPARVD VVEKTQKEGV KSQENQGFVE FDAGCLTTEE INAMLNTLPI
DITFVDKNDT VKYFTQGKER IFARPKTIIG RKVQNCHPPA SVHIVEKIIE DLKSGKKDHE
DFWIKMGEKY VYIRYFAVRN EKGEYLGTIE VTQDIAPIQK ITGEKRLLSD A