Gene Cthe_2506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2506 
Symbol 
ID4809445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2973047 
End bp2976088 
Gene Length3042 bp 
Protein Length1013 aa 
Translation table11 
GC content44% 
IMG OID640107922 
ProductS-layer-like domain-containing protein 
Protein accessionYP_001038901 
Protein GI125974991 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0810] Periplasmic protein TonB, links inner and outer membranes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAAA AGATTGGAAA AGTTTTGAGG CAAAACAAGT CTATAATTTC CGTTGTAGTT 
ATAACAGCGA TTTTGTTTGT TTATAATACC GGAATGCTTT TTACGGGAAT ATGGGAAGGA
GCACTGTACA ATGTGAAGGC TGAGGAAGTT CCTTTAAAGC TCGAGTTTTT CAACAATGTG
AAGGATGACA ATGTAACCCT TATAAGTCCG TATTTTCGTG TTATCAATAA TAGTTCATCT
GATGAAATTT ATTTGCAGCA TGTGAAGATT AGGTATTATT TTACACTGGA CAGCTCGGAC
AGCGAAGAAA CCATGAATTA TGAGATTTAT TATGCGGGTA AAAGTAATAT AGACGGTACT
GGAGCAGTGG AAGATATAAA GCCGAATACA ATCGTTAAAA TTGCCAAAAT GGATATACCG
ACTGATATGG CAGACCATTA CCTTGAGATT GGATTTGACG AAAGTTGCGG AACCATAGGG
CCGGATAAAA AAGTTGAAGT TATGGTAAGT ATTTCGAAGG AGAAATACAA GAAGTTTATA
CAGACAAATG ACTACTCCTA TAATGATTCC GCCGAAAATT ACGTATCGTG GGAAAAAGTA
ACTCTATACC TGGATGGTGA GCTGATTTCA GGAATTGAGC CTAATATGTA TGCAAGCAGG
GAAACCGGTG CATGGTATAT GTTTGATGAA GCTGTCGAAG GTTCAACGAA CGAATTTAAG
GACTATAAAG GTAATCACGG CAATGCGGTA CTGTATTCGG CAAACGGTGT TGTGCCGGGA
TTGAACGGAA ACAGTGTGTC CCTGGACGGA GTTGATGATT ATGTTGCTTT GCCTGACGGA
ATAGCCGGTA CTTTCTACAA CTTTACTATA GCTTTCTGGG TGAGGCTGGA CACCATAGGT
GAACAGCCGA TATTTGATTT TTTCGATAGT GGTTCCAACA ACAAATATAT GCGTTTAACT
GCCGAAAGTG ATGGAAAAAT TAAGTTTGCA ATGACGCAGT CAGGTTATTA TGGTGAGAAG
ACCATTACTT CAGGCTCGGC TTTGACTGAA GGTGTCTGGA AGCATGTGGC AGTTACCTTG
TCCGGAGACA CCGGTACTTT GTATATTAAT GGAGAGAATG TTGGGGAAAA TAACACGCTT
TCTTTAAGAC CTTTAACATT TCTGGGAGAA ACTTCAAAAG GCTATATTGG AAAATCCCAT
CAAACAGATT CATCGGAAGA TCCATACTAT AATTCATATC TTCATGGAAT GATAGATGAT
TTCCGGATTT TCGACAGGGC TTTAAGTGCT GATGAAATTA AGACACTTGC AAGTGTTGCA
ACAAGGGTAA ATGATTCGGA TCCGGGAATA CATTACAGCA GTGGCTGGAG CCATAGCCAG
GAAAGAGATA AAGGTGATTA CTTAAATGAC GTTCATGAAC TTGATTCACC GGACGGAGAG
AACTGTTTTG AGTATACATT TACCGGAACC GGAGTGAATG TCATAGCTCC CCAATGCAGT
GATAACGGTG ACGCAGAGAT ATATATTGAC GGAAAACTTA TGAAGTCGGT GGCTATGAGC
GTTTATTCGG GCTACAATTC CCAGGCAGTG GTTTACAGTA AACTGGGACT GTCACTCGGA
ACCCATACAA TAAAAGTTGT ATTTAAAAAT GGTATTGGAA TTATAGACGC TTTAGACATA
ATGACAGGTG AAATAGTAAG TCCGAGTCCG ACGCCGACGC CAAGCCCAAC ACCGAGTCCG
ACACCGACGC CGAGCCCAAC ACCGAGTCCG ACACCGACGC CGACACCAAC ACCGAGCTCG
ACACCGACAT CGACACCGAC ACCGGAACCG AGTCCGACAT CGACGCCGAC ACCGGAACCG
AGTCCGACAT CGACGCCGAC ACCGGAACCA GAACCGGAGC CGACATCGAC GCCGACACCG
GAACTGAGCC CAAGCTCGAC TCCAGTACCG ACGCCGACAC CGACTCCAAC GCCAGCGCCA
AACCCTGCAC CGGAACCGGT ACCAATATCA ACACCGGTAC CGGAACCAAT ACTGATTCCG
ACTCCAACGC CAACCATGAC ACCGATGCCG ACACCGACCC CAACTCTTGA GGTCAAAAGC
GATCCATACC TTTCCGACCT TGTTGTGACA GGAGCAAAGC TTAAACCTGC GTTTGTTCCG
GATATACTGA ATTACGAGGC AGTGGCGGAG GAAGATGTAA GGTTTGTGTG TATTGTTGCT
TATGCACGGG ATGACGGAGC TGAGATTACT TTAAACGGTG TTCCTGTTAA AAGCGGAAGT
ATATCCCATG CTGTTGAGCT TAAGGAAGGA AAAAATGAGC TGATTGTTAA AGTTGTGGCA
GAAGATGGAA TAACATCAAG AACGTACCGA ATAAGTGTTT TACTTGAGGC ATTGCAATTG
CCTACACCAA CTCCTGATAA AAGCGGCAAT CCATTTTTCT CGTCATTGGA GGATTTGCTT
AAGGAAAATG AAGTGAGTCC AGACGGGACA AAGGGTGGGA TATTTGATGA TGTGCCCAGG
GGATATTGGG CGGAAGAGTA TATTCAAAAG CTCTATGAAA AGGGAATTAT AAGCGGTATT
GATGAAAAAA CGTTCATGCC CGGAAGACCT ATTACAAGGG CCGAGTTCAC ACAGATAATT
GTTAATTCCC TGAAAATCCC TTACAGAGAA GCCGGACTGC ATTTCAACGA CGTGACTGAA
AAAGACTGGT ATTACAAGAG CGTGTCTTCC GCCGCAGCCT TTGGAATAGT TGTCGGAAGA
CCCGACGGAA GTTTTGCTCC AAATGAATTT ATAACCAGAC AGGATATGGC AGTGGTTATT
GCCAAGTTTT TGGAGAAAAA ACATGACGGA AACCTGGAAG GAATGGGAAA AGGTCTGGTT
TTTGCCGACA GCGGCAATAT CTCGGAATAT GCGCGGGATT CGGTGGCGGC TGTGGTATCT
CAAGGGTTGA TGGTCGGAAA ACCCGGCAAC ATGTTTGATC CCAAAGGGCT TACCACAAGA
GCGGAAGCGG TAACGGTCAT TTGCAAGCTT ATGAAGTATT AG
 
Protein sequence
MKEKIGKVLR QNKSIISVVV ITAILFVYNT GMLFTGIWEG ALYNVKAEEV PLKLEFFNNV 
KDDNVTLISP YFRVINNSSS DEIYLQHVKI RYYFTLDSSD SEETMNYEIY YAGKSNIDGT
GAVEDIKPNT IVKIAKMDIP TDMADHYLEI GFDESCGTIG PDKKVEVMVS ISKEKYKKFI
QTNDYSYNDS AENYVSWEKV TLYLDGELIS GIEPNMYASR ETGAWYMFDE AVEGSTNEFK
DYKGNHGNAV LYSANGVVPG LNGNSVSLDG VDDYVALPDG IAGTFYNFTI AFWVRLDTIG
EQPIFDFFDS GSNNKYMRLT AESDGKIKFA MTQSGYYGEK TITSGSALTE GVWKHVAVTL
SGDTGTLYIN GENVGENNTL SLRPLTFLGE TSKGYIGKSH QTDSSEDPYY NSYLHGMIDD
FRIFDRALSA DEIKTLASVA TRVNDSDPGI HYSSGWSHSQ ERDKGDYLND VHELDSPDGE
NCFEYTFTGT GVNVIAPQCS DNGDAEIYID GKLMKSVAMS VYSGYNSQAV VYSKLGLSLG
THTIKVVFKN GIGIIDALDI MTGEIVSPSP TPTPSPTPSP TPTPSPTPSP TPTPTPTPSS
TPTSTPTPEP SPTSTPTPEP SPTSTPTPEP EPEPTSTPTP ELSPSSTPVP TPTPTPTPAP
NPAPEPVPIS TPVPEPILIP TPTPTMTPMP TPTPTLEVKS DPYLSDLVVT GAKLKPAFVP
DILNYEAVAE EDVRFVCIVA YARDDGAEIT LNGVPVKSGS ISHAVELKEG KNELIVKVVA
EDGITSRTYR ISVLLEALQL PTPTPDKSGN PFFSSLEDLL KENEVSPDGT KGGIFDDVPR
GYWAEEYIQK LYEKGIISGI DEKTFMPGRP ITRAEFTQII VNSLKIPYRE AGLHFNDVTE
KDWYYKSVSS AAAFGIVVGR PDGSFAPNEF ITRQDMAVVI AKFLEKKHDG NLEGMGKGLV
FADSGNISEY ARDSVAAVVS QGLMVGKPGN MFDPKGLTTR AEAVTVICKL MKY