Gene Cthe_1821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1821 
Symbol 
ID4809805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2154850 
End bp2155998 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content43% 
IMG OID640107235 
Productinner-membrane translocator 
Protein accessionYP_001038235 
Protein GI125974325 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4177] ABC-type branched-chain amino acid transport system, permease component 
TIGRFAM ID[TIGR03408] urea ABC transporter, permease protein UrtC 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAGGA ATTTGCGTTA TTTAAAAGGC AGTATTTTTA AAGGCAGGAT ATCTGCAAAT 
GACATTATTA TAACGTTATT ATTTGTTGTG TTGGCTTTAG CCCCGTTGTT TCTTTCGGAT
TTCAGGACCA ATCTTTTGGG AAAGTTTATT GCTTATGCCA TTCTGGCTTT AGGAATCGAT
CTCATATGGG GCTATACCGG AATATTAAGT TTGGGGCATG GAGTCTATTT TGGACTTGGT
GCGTATTGCA TGGCAATGTA CTTGAAACTT GAGGCGAGCA ACGGAAAACT TCCGGATTTT
ATGTCCTGGA GCGGGCAAAA TGTTTTGCCG TGGTTTTGGA AGCCCTTCGC TTACGCACCG
GTGGCAATTA TTCTTTCCGT ATTGGTGCCC GCAGTCCTTG CGTTAATAAT CGGGTATCTC
ACTTTCAAAA ACAGGATTAA AGGTGTTTAC TTTTCCATAC TGACGCAGGC CCTTTCCATA
ATATTCGTGG TATTGTTTGT GGGGCAGCAG GCTTATACGG GAGGAACCAA CGGTATAACC
AATTTCAAGA CCATCTTTGG TTTCCCGCTG TCCGGTTTCT CCACAAAGGT GACTCTTTAT
TATGTTGCAT TGGGATTTCT GATACTGGCC TTTCTGTTTT GCCGGTGGAT TGTGCAAAGC
CGGCTTGGAA AAGTGTTGAT TGCCATAAGG GACAGCGAAA ACCGGGCAAG ATTTTCAGGA
TACAATCCGG CAATATACAA AACCTTTGTT TACTGTATTT CTGCCGGACT GGCCGGATTG
GCAGGAGCTT TATTCGTTCC TCAGGTGGGA ATTATTTCAC CGGCAGAGAT GGGAATAGTC
CCGTCGGTGG AAATGGTTAT ATGGGTTGCA ATCGGAGGAA GAGGCACTTT AGTCGGATCT
GTCATCGGGG CTATACTGGT AAACTCTTTG AAGAGTATGG TAAGTGAGAG CTTTCCGGCA
GTCTGGTCCT ATTTTATAGG GATTTCCTTT ATTGCTGTGG TCATATTTAT GCCTTACGGT
CTGGCAGGGT TGTTAAATCA GATTAAAGGA AAAATATATG CTCAAAAAGC TCAAAAAAGT
GTAAAAAGGT ATTCTTCTGC CACTTTAAAT ATTCTTGAAG AATCAGGGTG TGATGAATAT
GTCGGATAG
 
Protein sequence
MERNLRYLKG SIFKGRISAN DIIITLLFVV LALAPLFLSD FRTNLLGKFI AYAILALGID 
LIWGYTGILS LGHGVYFGLG AYCMAMYLKL EASNGKLPDF MSWSGQNVLP WFWKPFAYAP
VAIILSVLVP AVLALIIGYL TFKNRIKGVY FSILTQALSI IFVVLFVGQQ AYTGGTNGIT
NFKTIFGFPL SGFSTKVTLY YVALGFLILA FLFCRWIVQS RLGKVLIAIR DSENRARFSG
YNPAIYKTFV YCISAGLAGL AGALFVPQVG IISPAEMGIV PSVEMVIWVA IGGRGTLVGS
VIGAILVNSL KSMVSESFPA VWSYFIGISF IAVVIFMPYG LAGLLNQIKG KIYAQKAQKS
VKRYSSATLN ILEESGCDEY VG