Gene Cthe_1423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1423 
Symbol 
ID4809084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1741874 
End bp1742980 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content39% 
IMG OID640106846 
Producthypothetical protein 
Protein accessionYP_001037847 
Protein GI125973937 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00001195 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAGAA ATCTCATAAT TGTTATTACT TATGCAGTGG CTTTGGTTCT TATTGTCATA 
AATTTTGTAC CGATTATGCG CGCTGTATGG AAGTTTATTG TTTTATTCAA ACCGTTTTTT
ATGGGAATTG CCGTTGCTTT TGTCCTAAAC AGGCCATGCA TGGCGGTTGA GAGGTTTTTG
AATAAAAGGT TGTTCAAAAA TCGATTAAAA GTACTTGTCA GAGGAATAGC AATTACTGTT
ACATATTTAG TGGTACTTTT GTTGATTACG CTGATAATAA GTTTTATAAT ACCGGAACTT
ATAAAAAGTA TACAAGTGTT TTTAAGCAAT ATGGGAGCAT ATATAGATAA TTTCAGGGAT
TTGACCAATG AGCTTTCCGA ACTCTTGGGA CTTGAAAGGA TTGACCTGTC GTCTTTGGAC
AAATTGATTC TTGAGTACAC AAACAGATTG GGAAGCAGCT TGACCGAGCT GATGCCGAAA
ATTATCAGCA TTACGACGGG GGTTTTGTCA TTCTTTGCAA CATTGGTAAT AACGGTGGTG
TTCTCGATAT ATATTTTGGC GGGAAAAGAA AGACTTATCG GACAATGCAA AAAAGTTTTC
AGCACTTATC TTCCCGAGTG CCTGTACAAG AAGGGAGCTT ATGTGTATCG TGTTGTGGTG
GATGTGTTTA ACAAATATAT ATATGGACAG CTGGCGGAGG CTTTCATTTT AGGTTCGCTT
TGCTTTATTG GGATGGTTAT TTTTCGGTTT GAATATGCAC TTCTCATAAG CGTTTTAATT
GCAGTTACCG CATTGGTGCC GTATTTTGGA GCGTACATAG GCGGATTTTG TGCGTTCATG
CTCCTTTTAA TGATTTCGCC CACTAAAGCT ATATGGTTTT TAGTTTACCT GGTAGTATTG
CAACAGTTGG AAAATAATTT AATATACCCA AGGGTTGTCG GAAGCAGTCT TGGACTTCCC
GGAATATGGG TTGTTTTGGC GGCAATTGTC GGTGCCGGAG TCGGGGGCCC GATTGGTGTT
TTGACTGGGG TACCGATTGC AACAGTTCTT TTCACTTTGC TTAGAAATGA TGTTTTAAGA
AGATCCGGAA AGCAGAATGT TAAATGA
 
Protein sequence
MTRNLIIVIT YAVALVLIVI NFVPIMRAVW KFIVLFKPFF MGIAVAFVLN RPCMAVERFL 
NKRLFKNRLK VLVRGIAITV TYLVVLLLIT LIISFIIPEL IKSIQVFLSN MGAYIDNFRD
LTNELSELLG LERIDLSSLD KLILEYTNRL GSSLTELMPK IISITTGVLS FFATLVITVV
FSIYILAGKE RLIGQCKKVF STYLPECLYK KGAYVYRVVV DVFNKYIYGQ LAEAFILGSL
CFIGMVIFRF EYALLISVLI AVTALVPYFG AYIGGFCAFM LLLMISPTKA IWFLVYLVVL
QQLENNLIYP RVVGSSLGLP GIWVVLAAIV GAGVGGPIGV LTGVPIATVL FTLLRNDVLR
RSGKQNVK