Gene Cthe_2802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2802 
Symbol 
ID4809639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3303221 
End bp3304204 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content41% 
IMG OID640108222 
ProductNLPA lipoprotein 
Protein accessionYP_001039194 
Protein GI125975284 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000138117 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAA AAGCGGCAAT TTTCATATTA CTGGTTTTGA TTATATTAAG CTTGGCAGGC 
TGCAGTAATG GAGACAGGGC GGCCGGAACG GCAAATAACG GGACAAATGC CAATACGGAA
GTCAAAACGG TAAAAATTGC TTATCTGCCT ATTACCCATG CTCTTCCGCT TTATGTGGAA
AATGAACTTG CAAATGAAAA CTTTAAAAAT TTTAAACTGG AGCTTGTAAA GTTTGGTTCG
TGGACGGAAC TGGTGGATGC TTTGAATTCA GGAAAAGTGG ACGGTGCGTC CATGCTTATA
GAACTTGCAA TGAAAGCAAA GGAGCAGGGG ATTGATTTAA AAGCGGTTGC CTTGGGTCAC
AGAGACGGAA ATGTGGTGGT GGTATCCAAG GATATCAATA AAGTTGAAGA TTTGAAAGGA
AAAAGCTTTG CCATACCAAG CAAGCTTTCA ACTCATAATA TTCTCTTACA TATTATGCTG
AAAAACCATG GCCTTGCATA TAACGATGTA AATGTTGTTG AGCTTCCACC GCCGGAAATG
GCGGCCGCTC TTGCGGAAGG CAGGATATCC GGCTATTGTG TGGCTGAGCC TTTTGGAGCA
AAATCGGTGG CAGTGGATAA AGGTAAGACC TTGTTTGAGT CCCAGGATTT GTGGGAAGGT
TCTGTGTGCT GCGGATTGGT TCTTAGAAAT GATTTTATCA AAAATAACGA GGCTATAGCG
GAGGAATTTA TCAAAGAATA CATAAAAGCA GGGGAAAAAG CTGAAGCAAA AGATGAGACA
ATCCGGGATA TTGCCACAAA ATATCTAAAA GCGGAGGAAC AAGTGCTGGA TTTGTCTCTT
AAATGGATTT CCTATGAAAA CTTGAAACTT GAAGAAAAGG ATTACAATGA GCTTGCAGAA
TACATGGTGG AAATGGGACT TTCCGAAAAT CCTCCGAAGT ACGACGAGTT TGTGGATAAT
ACATTTATAG GTAAAGTGAA GTGA
 
Protein sequence
MKRKAAIFIL LVLIILSLAG CSNGDRAAGT ANNGTNANTE VKTVKIAYLP ITHALPLYVE 
NELANENFKN FKLELVKFGS WTELVDALNS GKVDGASMLI ELAMKAKEQG IDLKAVALGH
RDGNVVVVSK DINKVEDLKG KSFAIPSKLS THNILLHIML KNHGLAYNDV NVVELPPPEM
AAALAEGRIS GYCVAEPFGA KSVAVDKGKT LFESQDLWEG SVCCGLVLRN DFIKNNEAIA
EEFIKEYIKA GEKAEAKDET IRDIATKYLK AEEQVLDLSL KWISYENLKL EEKDYNELAE
YMVEMGLSEN PPKYDEFVDN TFIGKVK