Gene Cthe_1807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1807 
Symbol 
ID4809791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2141116 
End bp2142780 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content46% 
IMG OID640107221 
Producthypothetical protein 
Protein accessionYP_001038221 
Protein GI125974311 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2834] Outer membrane lipoprotein-sorting protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAAAA ATGAAAAGAA ATTATCCGAA TACATTGATA AATTAAATGC CGAGAAAATG 
CCTGACGAGC ACGAGTGTCT GCCGGATTCA CCGGAATTGG AGGAACTTAT GGATACGGTA
AGAAAAATTC GAAGTCTGAA GGAGCCTGCT CTGCCGGATG CGGATTATCC AAAAAAGCTG
GCCCGGGTAG TCAGTGCTCA ATTATCGCAA AAATCCGCCG CCGGAAAAAG AAAATGGACA
TGGCTGGCCG GAGCGGCTGC TGTTGCGGCA GTTGCTGTCC TGGTTTTTGT ACTGAATTTT
GTACTGTATT CCGGCAGAAC CGACATTGTA TACGCCATGG AGCAGGCATA TAAGGAAGTT
AAAGCATATC ACGGAATCCT CAGCATTGTT GAAACCAATC TCAATGGAGA AGAGACTTTG
CAGGCAATGC GGGAGGTTTG GGCGGACAGC GAGGGACGCT ACTATGTAAA AGAGCTTCAG
GGCTTTCAGA AAGGCTTGAT AACCGTAAAC AACGGCGAAA AAAAGTGGCA GGTGAGTCCT
GCTGAAGAAC AAGTATACAT CTTTCCATCA TTCCCCGATC CATACAAATT CACCTTGGAA
CTTGGCAATG AAATAAAAGA TGCCAAAAAT GCCGAACAAA TCAAAGCCGT GGGAGAAGAG
ATGGTTGCGG GAAGAGAAAC CTCTGTATTT GAGGTACTGC CCAGAGGAGG GGAATCCTAC
AAAATATGGA TTGACAAGGA GACGAATCTG CCGCTTCAAA AAGAGAGTGC TATGATGAAT
GCAATTCAAT ACAGGGTAAC CTATACCAGC ATTGAGTTTG GCGACAATAT ACCCGGTGAG
CTTCTTGCTT ATAGCTTGCC GCAAGGCTTT AAGGAAATAG ATAAGAATCC CGAACTGCAG
GTCGGCAGCG TTGAAGAAGC TGCGGAAACA GCCGGTTTTA CTCCCCAAAT ACCCCAAAAT
GTTCCCGGGG GATATACAAG AAACGGCATG GCAGTTACAG GGGATATGAA AACCGTCAAG
CTAAGCTATA TATCCCAGGA TAAGAAAAGC CGGGTAATTA TTTTGCAGAA AAAAGCAACG
GATGAGTTTA AACCTGCATC AACAGCGGTT TTAGGCAAGG TGGGCGGCAA TACTGCCGAA
ATTCAGTCTC CTGTGCAGGA CAGTCCTGGA GTGCTTGAAG GAGGAATGTA TTCAGGGATG
GCGGATATCC GCTCGATTCG CTGGCAGGAA TCCGGATTTG AATATGCTGT GATAGGCGAT
GCGCCAATGA ATGAATTGAT TTCATTCATT GAAAGTATAA CAACAGGTCC GGTTGAGATA
CCGCCGGAAA ACGAAGAAAC CCCAGAGAAG CCTCAGATTG AAGTTCCGGT TGATCTGAAA
GTCGAGAAAA ATGAGCAAAA AAGCGTGGAT GCGGGACATT CACCGTGGAA ACTGGATCCT
GTTTATGTCG CACAAGTATT TGTAAGCCTG AAAATTTCTC CTGAAGGCAT TGAAGGAGAA
TATCCGGTAA GTTATGAAGA CATGGAGGTT GTAAAAAACA ACGGCATAGA GGCGGTAGTG
GAGATAAGCG GTGATAACAC ACCTGTGCGC AGGGTTTATT TAAAAAGACT GATAAGACAG
GACAGCACGG GAATATGGAC TGTGGTCGGA TATGATCCGG TTTAA
 
Protein sequence
MDKNEKKLSE YIDKLNAEKM PDEHECLPDS PELEELMDTV RKIRSLKEPA LPDADYPKKL 
ARVVSAQLSQ KSAAGKRKWT WLAGAAAVAA VAVLVFVLNF VLYSGRTDIV YAMEQAYKEV
KAYHGILSIV ETNLNGEETL QAMREVWADS EGRYYVKELQ GFQKGLITVN NGEKKWQVSP
AEEQVYIFPS FPDPYKFTLE LGNEIKDAKN AEQIKAVGEE MVAGRETSVF EVLPRGGESY
KIWIDKETNL PLQKESAMMN AIQYRVTYTS IEFGDNIPGE LLAYSLPQGF KEIDKNPELQ
VGSVEEAAET AGFTPQIPQN VPGGYTRNGM AVTGDMKTVK LSYISQDKKS RVIILQKKAT
DEFKPASTAV LGKVGGNTAE IQSPVQDSPG VLEGGMYSGM ADIRSIRWQE SGFEYAVIGD
APMNELISFI ESITTGPVEI PPENEETPEK PQIEVPVDLK VEKNEQKSVD AGHSPWKLDP
VYVAQVFVSL KISPEGIEGE YPVSYEDMEV VKNNGIEAVV EISGDNTPVR RVYLKRLIRQ
DSTGIWTVVG YDPV