Gene Cthe_1580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1580 
Symbol 
ID4809571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1910020 
End bp1911138 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content47% 
IMG OID640106998 
Productinner-membrane translocator 
Protein accessionYP_001037999 
Protein GI125974089 
COG category[R] General function prediction only 
COG ID[COG4603] ABC-type uncharacterized transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0212662 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACACA AAAACAACCA GAATAACAAG CCCTCTCAGA ATATTTTGAA AATTAAAGCA 
CAAAGACTAA TTCATCAAAA TGCCCTATAT ACCATTATTG CTATTTTCTT CGGATTTCTG
GTCGGCGGCA TTTTCCTGTG GGCAGCCGGC TTCAGTCCCA TTGAAGCATA TGCAAAGCTG
TTCAGCAGCG TGTTCAGCCG TCCCAAATAT CTCATTTGGT CGGTGATCTA TGCAACTCCG
CTGATATTTA CGGGGTTAAG CGTGGCATTT TCTTACAAAA TGGGTGTTTT CAACATCGGC
GCCGAAGGGC AGTTTGTTGT AGGCTCACTG GCTGCTCTGT GCGTAGGCAT CCTGGTGGAC
GCTCCTCCCG GGTTGCACGT GCTGCTATGT ATGCTGGCCG CTGTTGCCGC CGGCATGCTG
TGGAGCTTTC TGGTGGCAGT GCTGCGCGTC CGGTTCGGTA TCAATGAAGT TTTGTCCTTC
ATTATGTTCA ACTGGATCGC CTTTTATTTT TCAAATTATG TAGTCAATAC TGCAGCTATT
CACAAGGTTG GCGGCGGAGA GGCTTCCAAG GATATTCGCG AATCTGCAAG AATTCTGCTG
CCCCAATCGT TGCAGAATAT TTTTCAGAGT AATAAAGCCA ACTACGGTAT TTTCCTGGCA
ATTATTGCGG CAATTGTTAT TTGGTTTATC CTGACGAAAA CCACTCTTGG CTATAAAGTA
CAGGCCGTTG GTCTCAATCC TCACGCAGCC AAATACGGCG GTATTAATTC CAACAAGACG
ATGTATATTG CCATGAGCCT GTCGGGTGCT CTGGCAGCTT TGGGCGGTGC CGTGCAACTG
ATGGGTAACT CCATGCGCAT CAGTCAGTTT GCCGGACAGG AAGGCTTCGG CTTCCAGGGA
ATCACCGTTG CGTTAATCGC AAGCTCTCAC CCTATTGGTT GTATTTTTTC AGGGCTGTTT
TACGGTGCCA TGAAATACGG CGGCTCCAAG CTCAATCTAA TTGATGCTCC CACAGAAGTC
GTTGACATCA TTATGGGCAC CATCGTGCTC TTTATCGCCA TATCACATGT ATTCCGCTAT
TTAATTACAA GACGGCTTAA GAATAAGGAG GACAAATAA
 
Protein sequence
MKHKNNQNNK PSQNILKIKA QRLIHQNALY TIIAIFFGFL VGGIFLWAAG FSPIEAYAKL 
FSSVFSRPKY LIWSVIYATP LIFTGLSVAF SYKMGVFNIG AEGQFVVGSL AALCVGILVD
APPGLHVLLC MLAAVAAGML WSFLVAVLRV RFGINEVLSF IMFNWIAFYF SNYVVNTAAI
HKVGGGEASK DIRESARILL PQSLQNIFQS NKANYGIFLA IIAAIVIWFI LTKTTLGYKV
QAVGLNPHAA KYGGINSNKT MYIAMSLSGA LAALGGAVQL MGNSMRISQF AGQEGFGFQG
ITVALIASSH PIGCIFSGLF YGAMKYGGSK LNLIDAPTEV VDIIMGTIVL FIAISHVFRY
LITRRLKNKE DK