Gene Cthe_1822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1822 
Symbol 
ID4809806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2156010 
End bp2156918 
Gene Length909 bp 
Protein Length302 aa 
Translation table11 
GC content44% 
IMG OID640107236 
Productinner-membrane translocator 
Protein accessionYP_001038236 
Protein GI125974326 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0559] Branched-chain amino acid ABC-type transport system, permease components 
TIGRFAM ID[TIGR03409] urea ABC transporter, permease protein UrtB 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.564217 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAGTT ATTTACTGCA GATTTTTAAC GGGATAAGCG TAAGCTCGGT GCTTTTACTG 
GCAGCCCTGG GATTGGCAAT AACCTTCGGG CTTATGAGAA TTATCAACAT GGCCCATGGT
GAGATGATAA TGATAGGTGC CTATACCACG TATATGGTGC AAAACTTATT TATATCATTT
TTGGGAGAAA GATATTTCGA TCTGTATTTT ATTGCGGCAA TACCTCTTTC GTTCCTGATT
ACAGGATTTG TCGGTTATTT GCTGGAGATT TCAATTGTAA AACGCCTGTA CGGCAGGACT
TTGGACAGTT TATTGGCCAC ATGGGGAATA AGCCTTATAC TTCAGCAGTT AGCCAGAAAC
ATTTTCGGTG CACCCAATGT AGATGTGAGA AGTCCAAGAT GGCTGAACGG TGGGGTGGTT
GTTTTTGGAA ATATTCAGCT CCCCTACAAA AGGCTTTTTA TAATCCTGCT GGCTGCCCTT
TGCATAGCCG GAGTATATTT ATTCCTCTTT AAAAGCGACA GCGGAAGAAG GATAAGAGCG
GTTATGCAGA ATAGAAGTAT GGCCGAAAGC CTTGGTATAA ACACAAGAAA GGTGGATTCC
ATGACCTTTG CAATTGGGTC GGGATTGGCA GGCATTGCCG GATGTGCCTT GACCCTGTTA
GGCTCAATCG GACCAACCTT AGGAACCAAC TACATTGTGG ATACCTTTAT GGTTGTGGTA
TTGGGCGGTG TCGGAAGAAT TATCGGCACA GTGGCCGGAG CAGCAGCGAT CGGAATCGGA
AATACGACTT TCGAATATTT TACCAACGCC TCTTTGGGAA AGGTACTTGT ATTTCTTGCG
GTTATTCTCT TTCTTCAATG GAAGCCTCAG GGGTTCTTTA GCGTAAGCAG CAGAGTATTG
GATGAGTAA
 
Protein sequence
MDSYLLQIFN GISVSSVLLL AALGLAITFG LMRIINMAHG EMIMIGAYTT YMVQNLFISF 
LGERYFDLYF IAAIPLSFLI TGFVGYLLEI SIVKRLYGRT LDSLLATWGI SLILQQLARN
IFGAPNVDVR SPRWLNGGVV VFGNIQLPYK RLFIILLAAL CIAGVYLFLF KSDSGRRIRA
VMQNRSMAES LGINTRKVDS MTFAIGSGLA GIAGCALTLL GSIGPTLGTN YIVDTFMVVV
LGGVGRIIGT VAGAAAIGIG NTTFEYFTNA SLGKVLVFLA VILFLQWKPQ GFFSVSSRVL
DE