Gene Cthe_2446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2446 
Symbol 
ID4809825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2916929 
End bp2918047 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content44% 
IMG OID640107860 
ProductABC-type sugar transport system periplasmic component-like protein 
Protein accessionYP_001038841 
Protein GI125974931 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0237495 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAAAGT TAAAAAAAGT AATAGCTCTT CTTGTTACAA CAATGCTTGT TTTATCCGTG 
TTAGTTGGAT GTGGAAACAA TACAACCAAC AATAACAGCG GCACAAGCGG ACCAAGTACA
AACAGCGGAA CAAGCGGATC GGGCACAAAC AGTGGTACTA CGTCGAACAA TGTAACCGGA
AAAACCGATT TGGCTGACAC CAATTTTGAC ACAAGCTATA CGCCAAAGAG AACATCATAC
AAGATTTACT GCACATACAA GAATATTCAT GCCTGGTATG ATGCTATCAA ATGCGGTATT
GATGCTGCTG TAAAAGAGCT GGCAGAAAAA GGCGTTACAG TAGATTATGA ATGGTATGGA
CCTGCCCAGC CGGATGCCGT TGACCAGGTA AATTCCATTG AGACTGCAAT CGGACAAGGT
TGGGACCTTA TCGCTGTCGA CGTTAACCAG CCCGAATTGA CAGGAGAAGC AATCAACAAT
GCCGTCGCAA AAGGTATTCC TGTTGCTGTA TTCGGAACTT CAGACGTACC GAACTGTGAC
CGTGCATTCT TTGTAGGAAA TACTGACCCG TATGGTGATG GCTGTGCCCT TGCAAAAGCA
GTTTGTGAAA AGATGGGTGG CAAAGGTCAG ATTGCAATTC TGGCAGGTAC TATAGGAGCT
TTGGCTCACG AAGAAAGATT GCGTGGATTT AAGGATACAA TTGCAAAATA TCCTGATATA
GAAATCGTTG ACGAGCAGCG CGACAACGAC GAAGTTGAAA AGGCAATCAG TATTACAGAA
TCCTGGCTCC AGGCTTATCC TAACCTGGGA GGTATTCTCT GCAACAACAT GTCCAACCCG
GTTGGTGCAT GCCAGGCTGT AGCAGATGCC GGTAAATCAG GCAAGATCGT TATCGGCGGT
ATGGACCATG ACCTTCGTGC TTTGAATGCT CTGAAAGATG GTACTTTGTA TGTGGCACAA
GTTCAGAACT GCTATGACAT GGGTTACAAA CTTATCTACA ATGCAATAAA GACGATTGAC
GGTGAAAAAG TTGAAGAGTC AACAGCAGTA GGTTCCACTT CAGTGTATGC ACAAGACGCA
GATAAATTCA TCAATATGTT ATATGGAGAG GCAAATTAA
 
Protein sequence
MLKLKKVIAL LVTTMLVLSV LVGCGNNTTN NNSGTSGPST NSGTSGSGTN SGTTSNNVTG 
KTDLADTNFD TSYTPKRTSY KIYCTYKNIH AWYDAIKCGI DAAVKELAEK GVTVDYEWYG
PAQPDAVDQV NSIETAIGQG WDLIAVDVNQ PELTGEAINN AVAKGIPVAV FGTSDVPNCD
RAFFVGNTDP YGDGCALAKA VCEKMGGKGQ IAILAGTIGA LAHEERLRGF KDTIAKYPDI
EIVDEQRDND EVEKAISITE SWLQAYPNLG GILCNNMSNP VGACQAVADA GKSGKIVIGG
MDHDLRALNA LKDGTLYVAQ VQNCYDMGYK LIYNAIKTID GEKVEESTAV GSTSVYAQDA
DKFINMLYGE AN