Gene Cthe_0440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0440 
Symbol 
ID4808368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp554592 
End bp555995 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content40% 
IMG OID640105854 
Producthypothetical protein 
Protein accessionYP_001036871 
Protein GI125972961 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0769] UDP-N-acetylmuramyl tripeptide synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATAA GACTTATTTT CACAATAATT GTTACAAAAC TCCTTATTCT TGCATTAAGA 
ATTTTAAAAA GGGGAGGAAC TTCTCTTCCG GGAAAAGTCG CGTATAAAAT TTACCCGGAC
ATAATTAAGG TAATTTCAAA AGATTTTAAA ATAATAATGG TTACGGGAAC AAACGGCAAG
ACCACCACTA CCCGTATAAT CGGAAAAGTA CTTGAGGAAA ACAATATAGA GTACATTACC
AACAAATCCG GTGCCAATCT GGTAAGCGGT ATTATTACCA CTTTTATTGA ATCTGTAAAT
ATTTTCGGAA AAAGCAAAAC TTCCACAGCA TTGCTGGAAG TTGACGAGGC TGCCTTCAAT
GTGGTAACCG ACTACGTTCA GCCGGATGTT TTGGTGGTAA CAAACTTTTT CAGGGACCAG
TTGGACCGAT ATGGTGAGCT CTACACCACT GTAAGCAATG TCAGATCAGG CATCGAAAAG
TCACCGAATG TCAAGCTGGT TTTAAATGCA GACGACTCTC TTTGTGCATC ATTAGGTCAC
AATATGGACA GAGAGGCCAT ATACTACGGT TTTTCTGAGG AAGCGTACAA CAACAGCAGC
ACAGTTGTAA ACAGTGATGC AAGCATTTGC CTTTACTGCA AAAGCAAATA TGAGTACTCC
TACAATGTAT ATGGCCATCT TGGGGGATTT TCCTGTCCAA ACTGCGGATA TATGCGACCT
GATTCCAAGG TAACATGCGT AAAAATTAAC GAGCTTAACA CTTCGTATTC CGATATCATA
TTTTCATTAA GTCCAATAAA GGGCAATGAC GAGCCGGTTT CATACAACGC CAGAATTAAT
CTTCCCGGAC TTTATAACAT ATATAACGCA CTGGCTGCGG CGTCCCTGGG ACATCTTTCA
GGCTTTTCGC CGGAAAGCCT TGTAAAAGCC ATGGAAAGTT TCGAATGCGG TTTTGGACGA
ATGGAAACCA TCGAAACCGA CGGTAAAACC ATAAAGGTTA TTCTGGTTAA AAACCCAACC
GGCTTTAATC AGGTTTTAAG TTATCTTCTT ACGGAAAAGC AAAATACTCA AATAGCCTTT
GTTATAAATG ACCGCCTTGC AGACGGCACC GATATTTCGT GGCTGTGGGA TGTTGATTTT
GAGCAGCTTC AGCAAATGCA GGACAAAGTA TCCAGCTTTT ATGCTTCAGG AATCCGTGCG
GAGGATATGG CTGTAAGGCT TAAATATGCA GGAATAAACA TCGATAAGAT TCAGATTGAA
AAAGACTATG AAGAGCTTCT CAATAAAGCT TTAGCCACTA CTTCAGAAGG GCAGAATCTT
TATATACTGC CCACCTATAC CGCAATGCTT GAAGTAAGAA GCCTTCTGGA GAAAAAATTC
GGTTTAAAGG AGTTTTGGAA ATAA
 
Protein sequence
MNIRLIFTII VTKLLILALR ILKRGGTSLP GKVAYKIYPD IIKVISKDFK IIMVTGTNGK 
TTTTRIIGKV LEENNIEYIT NKSGANLVSG IITTFIESVN IFGKSKTSTA LLEVDEAAFN
VVTDYVQPDV LVVTNFFRDQ LDRYGELYTT VSNVRSGIEK SPNVKLVLNA DDSLCASLGH
NMDREAIYYG FSEEAYNNSS TVVNSDASIC LYCKSKYEYS YNVYGHLGGF SCPNCGYMRP
DSKVTCVKIN ELNTSYSDII FSLSPIKGND EPVSYNARIN LPGLYNIYNA LAAASLGHLS
GFSPESLVKA MESFECGFGR METIETDGKT IKVILVKNPT GFNQVLSYLL TEKQNTQIAF
VINDRLADGT DISWLWDVDF EQLQQMQDKV SSFYASGIRA EDMAVRLKYA GINIDKIQIE
KDYEELLNKA LATTSEGQNL YILPTYTAML EVRSLLEKKF GLKEFWK