Gene Cthe_1562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1562 
Symbol 
ID4810069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1890324 
End bp1891535 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content41% 
IMG OID640106980 
Producthypothetical protein 
Protein accessionYP_001037981 
Protein GI125974071 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00597787 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGCAT ATGCATTAAC TTCGAATAGA ATCTCCTTTC TAAATCTCAG AAAAAAGTCC 
TTTCGCACCT TTGCATTAAT ATTGGTGGTT GCAATCCTGT CTTTTGCACT TTTTGGAGGA
ACTGTCCTGT CCTTTAGCTT TAGAAACGGG CTTCGCAGTG TAGAAGCAAG GCTTGGTGCA
GATTTGATTG TGGTTCCTCT CGGGCATGGG AAACAACAGG AATCCATTCT TTTAACCGGG
GAGCCGTCCT ACTTTTATTT TAATAAAGAG ATTGTCCAAA AGCTTCAGGA AGTTGAGGGT
ATTTCCAAAC TGTCGACACA ATTTTACCTC ATTTCACTAA GTACAGGATG CTGTTCTCTT
CCTGTGCAGA TTGTGGGATT TGATCCGGCT ACGGATTTTT CAATTCAACC GTGGATTCAA
GAAACTCTGG GGGGAAATCT TGAAAACGGT GCTGTTATTG TCGGAAGTGA TATTATGATT
GAGGATAACA AGCACATTAA ATTTTTTGAT AAAGAATATC CCGTGGCGGC AAAGCTTGAC
AAAACGGGGA CAGGATTGGA TCAATCCGTA TTTGCTACTA TGGAAACCCT TAAAGACCTA
TATTCCGGTG CAAAAGAAAA AGGCTTTAAC TTTTTGGAGG ATACGGATCC TGATACCTTT
ATTTCATCCG TACTGATTAA AGTTCGTGAA GGGTACAATA TAGATCAAGT CATAACCAAT
ATCCGAAGAA AAACGGACGG TGTCCAAATT GTCAGGACAC AAAATTTAAT TACCGGCATT
TCAAAGAGCC TTGGCAATAT TATCACCTTT TTTAATGTAT TTGCATTGGT GCTTTTGGGC
GTAACCTTGG TAATTTTGAC GGTGGTATTT TCAGCCTCAG CCAATGAACG AAAAAAGGAA
TTTGCTATTA TGAGGATACT GGGGGCCACA AGGAAAAAGC TTGCTGCCGT TTTGATTTGG
GAGTCGCTGT ATATCAGTGT GTCAGGCGGT GCTATAGGAA CTATACTTGC GGCAATATTT
GTATTTCCCT TTAACGTTTT CATTGGTGAC AGCATAGGAT TGCCTTACAT ACAGCCTTCT
CTTCTTTGGA TTATAGCTAT TTTACTGGGA ACATTACTCG TGGCTTTTTC CCTTGGCCCA
ATTGCTTCAG CCTATTCTGC CGTTAAAGTC AGCCGTTCCC AAACTTATCT GACATTAAGG
GAGGGTGAGT AG
 
Protein sequence
MMAYALTSNR ISFLNLRKKS FRTFALILVV AILSFALFGG TVLSFSFRNG LRSVEARLGA 
DLIVVPLGHG KQQESILLTG EPSYFYFNKE IVQKLQEVEG ISKLSTQFYL ISLSTGCCSL
PVQIVGFDPA TDFSIQPWIQ ETLGGNLENG AVIVGSDIMI EDNKHIKFFD KEYPVAAKLD
KTGTGLDQSV FATMETLKDL YSGAKEKGFN FLEDTDPDTF ISSVLIKVRE GYNIDQVITN
IRRKTDGVQI VRTQNLITGI SKSLGNIITF FNVFALVLLG VTLVILTVVF SASANERKKE
FAIMRILGAT RKKLAAVLIW ESLYISVSGG AIGTILAAIF VFPFNVFIGD SIGLPYIQPS
LLWIIAILLG TLLVAFSLGP IASAYSAVKV SRSQTYLTLR EGE