Gene Cthe_0561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0561 
Symbol 
ID4808236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp688931 
End bp689977 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content40% 
IMG OID640105975 
ProductApbE-like lipoprotein 
Protein accessionYP_001036990 
Protein GI125973080 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAAAA GATTTACGGT AATTATTTTA AGTATAGTTT TGTGCACAAG TGTTTTGTTT 
GTATCATGCG GCTATAATTC TTCCGATTTG TATGAAACTC AGGAATTTTT AATGGGGACT
GTTGTTTTAC AGAAAATATA TCATGAAAAT GCCGCTGAAA TTGCAAAAGA GGTAAATGAC
AGAATAGCCG AAATTGAATC GACCATGACA ATAAACAAGC CCGGTGGGGA AATAAATCTT
TTAAACGACG CAGCGGGAAA AGAATATGTA AAACTTGGCG AGGATACTCT GTATGTGCTT
GACAAAGCAA AACAATATGC AGAGATTAGC AATGGAGCCT TTGACGTTAC TATAGGTCCT
TTGGTAAAAG CGTGGGGTGT TTTTACAGAC AATCCGAGGG TTCCATCGAA AAATGAAATT
GATGAGCTTT TAAAACTGGT AAATTATAAA GATATAAATA TTGACTTTGA AAATTCAACG
GCTATGCTGG CAAAAGAAGG ACAAATTGTG GATCTTGGCG GAATTGCAAA GGGATTTGCC
GCGGATGAAG CGGTTGAAAT ATACAAAGAA CACGGTGTAA AATCTGCGTT GATAAGCCTT
GGAGGCAACA TTTTTACATT GAGCGGCAAA CCTGACGGAA GTCCCTGGAT GGTGGGCATA
AGAAATCCCA GAGGTAACGA TGGTTCGTAT ATCGGGATTG TTAGGGTGAA AGACAAAGCG
GTAGTCAGTT CCGGTGACTA TGAGAGGTTT TTTGAAAAAG ACGGTGTGAG ATATCACCAT
ATTTTGGACC CCAAGACCGG CTATCCTGCT GATACGGGAC TTATTGGGAC CACTATTATT
TCGGACTTTT CAATTGATGC CGATGCTCTT TCGACAGCGG TTTTTGTGCT GGGTCTTGAG
GAAGGCATGA AACTTGTTGA AAGCCTTGAT GGGGTGGATG CGGTGTTTAT TACCGCGGAT
AAGAAAATAT ATGTAACGGA CGGATTGAAG GATACATTCA TATTTAAGGA TGAAAGCAAG
GAATTTGAAT ATGTTGAAAA AAGGTGA
 
Protein sequence
MTKRFTVIIL SIVLCTSVLF VSCGYNSSDL YETQEFLMGT VVLQKIYHEN AAEIAKEVND 
RIAEIESTMT INKPGGEINL LNDAAGKEYV KLGEDTLYVL DKAKQYAEIS NGAFDVTIGP
LVKAWGVFTD NPRVPSKNEI DELLKLVNYK DINIDFENST AMLAKEGQIV DLGGIAKGFA
ADEAVEIYKE HGVKSALISL GGNIFTLSGK PDGSPWMVGI RNPRGNDGSY IGIVRVKDKA
VVSSGDYERF FEKDGVRYHH ILDPKTGYPA DTGLIGTTII SDFSIDADAL STAVFVLGLE
EGMKLVESLD GVDAVFITAD KKIYVTDGLK DTFIFKDESK EFEYVEKR