Gene Cthe_1576 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1576 
Symbol 
ID4809567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1904910 
End bp1905977 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content47% 
IMG OID640106994 
Productbasic membrane lipoprotein 
Protein accessionYP_001037995 
Protein GI125974085 
COG category[R] General function prediction only 
COG ID[COG1744] Uncharacterized ABC-type transport system, periplasmic component/surface lipoprotein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones58 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT TTCTGGCATT ACTATTATCC GTTATAATAG TCTTTTCCCT GACTGCATGT 
GGCGGAAAAA ATTCCGGCAA CAACAGCAAT AACAGCAGTA ATAACAACAG CAGCAACAAT
ACCGGGGGCA AGAAGATTAA GATTGGTATG GTTACCGATG TTGGCGGTGT AAACGACGGC
TCATTCAACC AGTCTGCCTG GGAAGGTCTG CAGCGCGCTC AAAAAGAACT TGGTGTAGAA
GTTCGCTATG CCGAATCTGC AACCGATGCC GACTATGCTC CCAACATTGA GGCTTTCATT
GATGAAGGCT ATGACCTCAT CATCTGTGTA GGATACATGC TGGCTGATGC CACCAGAAAA
GCAGCTGAAG CCAATCCAAA TCAGAAATTT GCCATCATTG ACGATGCTTC CATCGATTTG
CCCAACGTTA CCTGCCTGAT GTTCGAGCAG AGCCAGGCTT CCTACCTGGT TGGCCTTGTT
GCCGGTAAAA TGACCAAAAC AAACAAAGTA GGATTTGTTG TCGGTATGGT CAGCCAGACC
ATGAACGAAT TCGGTTACGG GTATCTTGCC GGCGTGAAAG ATGCCAACCC CAATGCTACT
ATCCTGCAGT TCAACGCTAA CTCTTTCAGC AGCACCGAAA CCGGTAAATC CGCTGCTACC
ACAATGATCA CCAACGGCGC GGATGTAATC TTCCACGCAG CTGGCGGAAC GGGCTTAGGC
GTAATCGAAG GCTGTAAAGA CGCAGGCAAA TGGGCAATCG GTGTAGACAG CGACCAGTCC
CCGCTTGCTC CTGAAAACAT TCTGACCTCT GCTATGAAAC GCGTTGACAA TGCATGCTTT
GATATTGCCA AAGCCGTAAA GGAAGGCAAT GTTAAGCCTG GCATCATCAC GTATGACTTA
AAGTCCGCAG GTGTAGACAT CGCTCCTACC ACCACCAACC TGCCAAAGGA AGTTCTCGAT
TATGTAAACC AAGCTAAGCA GGACATCATC AACGGTAAAA TTACCGTTCC GAAGACCAAG
GCTGAGTTTG AAGCAAAATA CGGCAACATA TACGAATTAG ACGACTAA
 
Protein sequence
MKKFLALLLS VIIVFSLTAC GGKNSGNNSN NSSNNNSSNN TGGKKIKIGM VTDVGGVNDG 
SFNQSAWEGL QRAQKELGVE VRYAESATDA DYAPNIEAFI DEGYDLIICV GYMLADATRK
AAEANPNQKF AIIDDASIDL PNVTCLMFEQ SQASYLVGLV AGKMTKTNKV GFVVGMVSQT
MNEFGYGYLA GVKDANPNAT ILQFNANSFS STETGKSAAT TMITNGADVI FHAAGGTGLG
VIEGCKDAGK WAIGVDSDQS PLAPENILTS AMKRVDNACF DIAKAVKEGN VKPGIITYDL
KSAGVDIAPT TTNLPKEVLD YVNQAKQDII NGKITVPKTK AEFEAKYGNI YELDD