Gene Cthe_1932 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1932 
Symbol 
ID4810790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2305451 
End bp2308456 
Gene Length3006 bp 
Protein Length1001 aa 
Translation table11 
GC content42% 
IMG OID640107348 
ProductS-layer-like domain-containing protein 
Protein accessionYP_001038343 
Protein GI125974433 
COG category 
COG ID 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type
[TIGR01563] phage head-tail adaptor, putative, SPP1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.446643 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAAA AGTTTTCAAA TAAACCTGGA ATCAGAATTT GTTTTGTGAT GGTATTTTCA 
TTTGTACTGT CTTTTGCCTT GAATATTCCC GTACATGCCG ATGAAGAGCC GGAAAAGATA
TATTACGGAT TGAAGAATGC TTCCACAATT CTTAACAATA TAAATTTTAC CGATGTCAGA
AACTCTGCAA CATGGTCAAA GGAGGCAATT TGCGAGGCTG CGGCTTTGGA TATTGTTAAA
GGCTACGGCA ACAGGGTGTT TGGGCGCACA AACAATGTGA CAAAGGAAGA GGCAATTGCC
ATCATATACA GGGCGGCAGG AAGGGAAAAG GATGCACAAT TGGCGGCGGA GGCCCTTGAG
ACTGCTAGAA ATGCCGAAGA CAGAAGCAGT TATGCCCCAA GCATGTGGTC CGACGGGTAT
CTGCAGCTTG CAGCCGGTGA GGGCCTGATT AGCAGCCAGG ATCTTCAGGA TGCTTTCACC
GCGGATCAAA GCAGCCTTGG GACAGGAGCA TTTTTAAGAA GGGCGCCGGC CCAAAGGCAG
GAAGTGGCCT ATTGGATTTC AAAAGTTTTC GGCATTGAAC CTGTCTACGG TCAGCAGAAA
ATATTCAACA GTTTTCGGGA CTGGTCGAGC GCGGATCCTT TGAAAATTCC GTACATAGAA
GCAGTTCTTG TGGAAAACAT AATGAACGGT GAGGGGAATG GTTATTTCAG ACCCACGGGT
TTTGTAACCA GGGAACAGAT TGCGCAAATA ATAAAAAATG CGGATAAAAG AGTTCTTCCG
CTTTTAGGCT ATGAAAAGAA AATGGGTACG GTGGAAGACA TCCGCAAGGA ATACGATTTT
ACGGGAAATA CTCATGTATA TACCAATACT TTCCATATAA GAAGCAGCAA CGGCAAACTT
CACAAGATTG TGACCGAGTT TTACGACAGC AGCAGGAGCG GATATAGAAA TGAGCAGGGC
GGAAGCAGCA TAAAACCGTC CGACACCGAT CTTATTGTTT ACAGGGACGG ATGGATAGGC
ACAAGCTCCC TTTTGAAAGT CGGAGACAGA GTAGAGTATA TAACGGCAGC GGACGGCACA
GTCAGATATG TGCGGGTGAT ATCCGCCACG TCGGATACGG AATACATTGT TGCAAAAATA
AAAGAAGTAA ATTATGACAC TTCCACCATA AAGGCGGCAA TAGGGTTTTA CCTTGAGTAC
CCGGACATTG ACATGGACAG TATAAGTTTT GATTTTAGAA ACAGCGACGG GACAGTTGAC
GCAACACTTG TATACAGCAA TATTGTAAAA GTGTTTATAG AGGGAAGGAT AGCAGGTATA
AGGGATATTG AACCCGGAAC AGATGCAATT CTTACTGTAA AGGACAATAT TATAACTACA
ATTACGACCC TTGACTTAAG ACTGAAAGAG CAGGGTGTGA TAAGTGGTAT TGTTGAGGAA
AACAATCCCC ACCTGGGCTA TATAAGTTTA TACAGTGATA ACGGTTTACG GGTGGACGAT
TCCCTTAACA AAATAAAAAT ATATAACTAT TCCAATCCGG AGGAAGTAAC GGTATATAAA
AATCACGTCC CTGCAAAACT TGAAGATGTC GAAAGCGGTG ATTCGGTATA CATTAAACTT
GACCAAAACG GAGCAATAGA AATGATAAGT GCCGTGGACA ACTATGTGGT AAAGTACGGG
AAAATAATAT CTGTAAGGCC GGGGCTTATA AATGTTGCCT ATGATGACGG CACGCAGCAA
ATTCTTAGTG TTGACGAAAA CGTTCCGATT GTGCAGGACA ACAAACTTAT GGATTTTGAT
GCCCTGAAGG ATGGAGACAG GGTGAAACTG CTTTTAAATA TCACAGACAA GGCTACAACC
CTGAAAAAGC TTACCGTAGA AGGAGACGAG CATTTTATTG CAAATATCTA CCGGGGTATA
GTGGCATCGG TGGATGATAT TTCGAAAAAG CTGTTGCTTC AGCACCTTGA AGTGTTTAAC
AACGGAGAAT GGGAAAGAAC GGAACAGAAA GGTTTTACCT CATTGAAAAT GTCTGAGGAT
AATAAATTTG TTTACAATGA CGATATTGTG GACATGCGCC GGGCAAAGGA GCTGATTGCG
GGAAGCACAG CGTATATTGC AGTTGAGAAA GACTACGGCG GAATTGAAAA GGCGGTTTTT
GTATCCGTTA AAAACTCCAA AGACAGTGAG GCTCCGGTGC TTAACGACAG CATCATGTCA
GCCACTTTCG GAAATAAAGG GGAATTCACG CTGAAAAAAG AATATCTGAA CATTTCTTAC
GGAAACGGGA CAATCATTGT AAAGGACAAC AGGCTGGTTA CCGGAAACAG CATAAATTCG
GAGGACAGGG CATATGTTGT TGCAAGCAGA AGCTATGACG ACGGAAAATA TTATGCTTTT
ATAGTCAAGA TAGATGACAA GACGGATTCG AACTTCTTTA CCATATATCG CGGAAGGATA
GCAAACATAA ATGAGAACAA AGACTTTACT TTGGAATCTT ATTCAGAGCT TAAAGGAGTT
GAATGGAACT ACTACAACAC CCCCAGAACC TTTAGAATCA CATATGACAC GCAAATTCTA
GGTGATGACG GAGTTGTAGG ACAGAGGGAC TTTACCGACT ATGGTGATTC AAGCTATAAG
AGCAGGACGG TATATGTATT GTCACACGGT GCCGATGCTG TGTTGATTAG TACTGCTCCT
TACGGTAACA TTAACGTAAA AGGTAAAATT TATGAACTTA TATCAGAAAA TGCCGATGGT
TCAGAGGCTC AGGAAGGACA GCAAGAACCG GTGGGATTTA AACTCCAGAA CTCAAAAGTT
TACGATTTGC AATCGCATAT GTGGGTGGAC GGTAAAGACA TGGATATTAA CCTTCTTAAA
AACAGCATTA TACTTAAGGA CAACAAAATT ATAAAACCGT CCGAATTAAA GAAAGGCGAC
AGTGTAAGGC TTATCAAAAA AGATGACGAA CAGGCTGGAG ATGCCTATAT TATATTCGTT
GAATAA
 
Protein sequence
MEKKFSNKPG IRICFVMVFS FVLSFALNIP VHADEEPEKI YYGLKNASTI LNNINFTDVR 
NSATWSKEAI CEAAALDIVK GYGNRVFGRT NNVTKEEAIA IIYRAAGREK DAQLAAEALE
TARNAEDRSS YAPSMWSDGY LQLAAGEGLI SSQDLQDAFT ADQSSLGTGA FLRRAPAQRQ
EVAYWISKVF GIEPVYGQQK IFNSFRDWSS ADPLKIPYIE AVLVENIMNG EGNGYFRPTG
FVTREQIAQI IKNADKRVLP LLGYEKKMGT VEDIRKEYDF TGNTHVYTNT FHIRSSNGKL
HKIVTEFYDS SRSGYRNEQG GSSIKPSDTD LIVYRDGWIG TSSLLKVGDR VEYITAADGT
VRYVRVISAT SDTEYIVAKI KEVNYDTSTI KAAIGFYLEY PDIDMDSISF DFRNSDGTVD
ATLVYSNIVK VFIEGRIAGI RDIEPGTDAI LTVKDNIITT ITTLDLRLKE QGVISGIVEE
NNPHLGYISL YSDNGLRVDD SLNKIKIYNY SNPEEVTVYK NHVPAKLEDV ESGDSVYIKL
DQNGAIEMIS AVDNYVVKYG KIISVRPGLI NVAYDDGTQQ ILSVDENVPI VQDNKLMDFD
ALKDGDRVKL LLNITDKATT LKKLTVEGDE HFIANIYRGI VASVDDISKK LLLQHLEVFN
NGEWERTEQK GFTSLKMSED NKFVYNDDIV DMRRAKELIA GSTAYIAVEK DYGGIEKAVF
VSVKNSKDSE APVLNDSIMS ATFGNKGEFT LKKEYLNISY GNGTIIVKDN RLVTGNSINS
EDRAYVVASR SYDDGKYYAF IVKIDDKTDS NFFTIYRGRI ANINENKDFT LESYSELKGV
EWNYYNTPRT FRITYDTQIL GDDGVVGQRD FTDYGDSSYK SRTVYVLSHG ADAVLISTAP
YGNINVKGKI YELISENADG SEAQEGQQEP VGFKLQNSKV YDLQSHMWVD GKDMDINLLK
NSIILKDNKI IKPSELKKGD SVRLIKKDDE QAGDAYIIFV E