Gene Cthe_2965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2965 
Symbol 
ID4810853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3483286 
End bp3484209 
Gene Length924 bp 
Protein Length307 aa 
Translation table11 
GC content40% 
IMG OID640108387 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001039355 
Protein GI125975445 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.681707 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTAAAT ACATTTTAAA AAGATTACTT GTATCACTTT TGACATTATG GATAATGTTT 
ACGTTAACTT TTTTCCTTAT GCATTTAACT CCCGGCAACC CGTTTTTGGG GGACGGAAAA
ATTACGACTG AAATACTGGC AAACCTGGAG GCTAAATACG GCCTGGACAA ACCCCTGATG
GTTCAATACA AAATGTATGC AATGAACGTT CTTAAAGGTG ACTTGGGGGA ATCAATAAAA
CTGTCGGGAC AGACTGTCAA TGAAATAATT GCCAGAAAGT TTCCCTATTC ACTAAAGCTG
GGGTTGTTCT CCAGTGCCAT AGCCATAATT ATGGGCACCG TTCTGGGAAC AATAAGCGCG
CTTAAGAAGA ATACCGCCGT CGACAAAATA ATTATGATCA TTGTAACAAT TGGAATTGCG
GTCCCAAGCT TTGTAGTAGC CACAGTAAGT ATGGTTTTAT TCGGGGTAAA GCTGCATCTT
TTACCTACAG TCAGCCTCCT CGACAACTTT TCAAGTTATA TATTGCCCGG TTTTGCGCTG
TCCTTCTTCC CCTTAAGCTT TATCACAAGG CTTATGCGCT CTTCAATGCT TGACGTTATA
AACCAGGATT ATATAAGAAC GGCGCGGGCC AAAGGCTTGT CGGAAACAGT GGTAATTTTC
AAACACGGTT TGAGAAACGG GATTCTTCCG GTGGTAACTT ATGCCGGACC AATGATTGCC
GGAATTGTAA CAGGCTCTTT TGTAATCGAA TCAATTTTCT CAATACCGGG CCTGGGAAGT
TCTTTTGTAA CAAGCATTAC AGCAAAGGAT TACCCCACGG TAATGGGAGT AACTATATTT
TACGGTGCTC TGCTTATTTT TATGAACTTT TTGGTTGACA TAATATATAG ATTTGTCGAC
CCAAGAATTA ATATCACCAA ATAG
 
Protein sequence
MVKYILKRLL VSLLTLWIMF TLTFFLMHLT PGNPFLGDGK ITTEILANLE AKYGLDKPLM 
VQYKMYAMNV LKGDLGESIK LSGQTVNEII ARKFPYSLKL GLFSSAIAII MGTVLGTISA
LKKNTAVDKI IMIIVTIGIA VPSFVVATVS MVLFGVKLHL LPTVSLLDNF SSYILPGFAL
SFFPLSFITR LMRSSMLDVI NQDYIRTARA KGLSETVVIF KHGLRNGILP VVTYAGPMIA
GIVTGSFVIE SIFSIPGLGS SFVTSITAKD YPTVMGVTIF YGALLIFMNF LVDIIYRFVD
PRINITK