Gene Cthe_1761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1761 
Symbol 
ID4810191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2082568 
End bp2083776 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content40% 
IMG OID640107174 
Producthypothetical protein 
Protein accessionYP_001038175 
Protein GI125974265 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000119528 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTTTC TTGAAAGTAT CAGACAGGCG CTGGACAGCC TTAAAGCAAA CAAGCTGAGA 
TCCATTCTTA CCATGCTGGG AATTGTCATG GGTGTGTTTT CGATTATTAC CATAATGGCG
ATTGGAAATG CCACTGAAGA GTATATTAAC AGCCAATTTG AAAAAATCGG TGCCAATGTG
CTTACCGTAG GCTACAAAAA TATGAATGTC GACAGTGATG AGATGCTGTA TCTTAAGGAT
ATTGAAACAG TGAAAAGGGC TGCGCCGGAA ATAAAAAATG TTACAACTTA TATTCAGCAT
AGGGGAACAC TGCGAATTGA TACAAAAACA AGAAGTGCTT TGGTGTATGG AACCACGGCC
CAGTACAAGG ACATTACGCC TATGGAGATG GCTGCGGGAA GATTTTTCAC TGATTTTGAC
ATATCTTCGA GACAGAAGGT TGTAGTTGTT GATGAATACT TTGCAAAAAG ATATTTCAAC
AGGCTGGACA TAGTAGGTGA AGTCCTGCAA TTTAAAGCGC CTTCCGGGAA CTATAAGGTA
AAGGTGATTG GGGTTGCAAA AGCTATTAAT GACGCCATGG CAAATTTATT GGACAATGAA
AACTTTCCGA CTCAAATATA TATGCCCATT ACCACGGTTC AGCAAATGTA TTACAATAAT
GAAGCTTTAG ACAGTATTCT TGTTGCGCTG GACAAGGAAG TAGACTATAA GGAAGTCGGG
AACAGAATAG TAAGGGCTTT GGAGATGAGC AAAGGTAAGA AAGACATATA TATGACATAC
AGTACACAGG ATTCTCAGGA AATACTTTCA AGTATAATCG GTGTTGTATC GGCGGTGCTT
TTGGTCATTG CAATTATTAC CCTCATAGTC GGAGGTATAG GTATTATAAA CATATTGCTT
GTTTCCGTGA CGGAAAGAAT AAGGGAAATA GGCATCAGGA AGGCATTGGG AGCCCAGAAA
AAGGACATAA TTTTTCAGTT TATCACAGAA TCAATTATTA TGACGGGAAT AAGCGGCAGT
ATAGGAATAT TTCTTGGAGT TTTGGGAGGC AATATAATTT CTCAGGCTAT TCAAATTCCG
CCGGTGATTG ATGTTCCTGT AATTATCGGG GTATTTTTAG GGTCGGTGGT ATTGGGTCTC
GTATTTGGTG TGTATCCTGC AAAGAAAGCG GCTGACCTGG ATCCTATAGA ATCTCTCAGG
TATGAATAG
 
Protein sequence
MSFLESIRQA LDSLKANKLR SILTMLGIVM GVFSIITIMA IGNATEEYIN SQFEKIGANV 
LTVGYKNMNV DSDEMLYLKD IETVKRAAPE IKNVTTYIQH RGTLRIDTKT RSALVYGTTA
QYKDITPMEM AAGRFFTDFD ISSRQKVVVV DEYFAKRYFN RLDIVGEVLQ FKAPSGNYKV
KVIGVAKAIN DAMANLLDNE NFPTQIYMPI TTVQQMYYNN EALDSILVAL DKEVDYKEVG
NRIVRALEMS KGKKDIYMTY STQDSQEILS SIIGVVSAVL LVIAIITLIV GGIGIINILL
VSVTERIREI GIRKALGAQK KDIIFQFITE SIIMTGISGS IGIFLGVLGG NIISQAIQIP
PVIDVPVIIG VFLGSVVLGL VFGVYPAKKA ADLDPIESLR YE