Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1932 |
Symbol | |
ID | 4810790 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2305451 |
End bp | 2308456 |
Gene Length | 3006 bp |
Protein Length | 1001 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640107348 |
Product | S-layer-like domain-containing protein |
Protein accession | YP_001038343 |
Protein GI | 125974433 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type [TIGR01563] phage head-tail adaptor, putative, SPP1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.446643 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAAAA AGTTTTCAAA TAAACCTGGA ATCAGAATTT GTTTTGTGAT GGTATTTTCA TTTGTACTGT CTTTTGCCTT GAATATTCCC GTACATGCCG ATGAAGAGCC GGAAAAGATA TATTACGGAT TGAAGAATGC TTCCACAATT CTTAACAATA TAAATTTTAC CGATGTCAGA AACTCTGCAA CATGGTCAAA GGAGGCAATT TGCGAGGCTG CGGCTTTGGA TATTGTTAAA GGCTACGGCA ACAGGGTGTT TGGGCGCACA AACAATGTGA CAAAGGAAGA GGCAATTGCC ATCATATACA GGGCGGCAGG AAGGGAAAAG GATGCACAAT TGGCGGCGGA GGCCCTTGAG ACTGCTAGAA ATGCCGAAGA CAGAAGCAGT TATGCCCCAA GCATGTGGTC CGACGGGTAT CTGCAGCTTG CAGCCGGTGA GGGCCTGATT AGCAGCCAGG ATCTTCAGGA TGCTTTCACC GCGGATCAAA GCAGCCTTGG GACAGGAGCA TTTTTAAGAA GGGCGCCGGC CCAAAGGCAG GAAGTGGCCT ATTGGATTTC AAAAGTTTTC GGCATTGAAC CTGTCTACGG TCAGCAGAAA ATATTCAACA GTTTTCGGGA CTGGTCGAGC GCGGATCCTT TGAAAATTCC GTACATAGAA GCAGTTCTTG TGGAAAACAT AATGAACGGT GAGGGGAATG GTTATTTCAG ACCCACGGGT TTTGTAACCA GGGAACAGAT TGCGCAAATA ATAAAAAATG CGGATAAAAG AGTTCTTCCG CTTTTAGGCT ATGAAAAGAA AATGGGTACG GTGGAAGACA TCCGCAAGGA ATACGATTTT ACGGGAAATA CTCATGTATA TACCAATACT TTCCATATAA GAAGCAGCAA CGGCAAACTT CACAAGATTG TGACCGAGTT TTACGACAGC AGCAGGAGCG GATATAGAAA TGAGCAGGGC GGAAGCAGCA TAAAACCGTC CGACACCGAT CTTATTGTTT ACAGGGACGG ATGGATAGGC ACAAGCTCCC TTTTGAAAGT CGGAGACAGA GTAGAGTATA TAACGGCAGC GGACGGCACA GTCAGATATG TGCGGGTGAT ATCCGCCACG TCGGATACGG AATACATTGT TGCAAAAATA AAAGAAGTAA ATTATGACAC TTCCACCATA AAGGCGGCAA TAGGGTTTTA CCTTGAGTAC CCGGACATTG ACATGGACAG TATAAGTTTT GATTTTAGAA ACAGCGACGG GACAGTTGAC GCAACACTTG TATACAGCAA TATTGTAAAA GTGTTTATAG AGGGAAGGAT AGCAGGTATA AGGGATATTG AACCCGGAAC AGATGCAATT CTTACTGTAA AGGACAATAT TATAACTACA ATTACGACCC TTGACTTAAG ACTGAAAGAG CAGGGTGTGA TAAGTGGTAT TGTTGAGGAA AACAATCCCC ACCTGGGCTA TATAAGTTTA TACAGTGATA ACGGTTTACG GGTGGACGAT TCCCTTAACA AAATAAAAAT ATATAACTAT TCCAATCCGG AGGAAGTAAC GGTATATAAA AATCACGTCC CTGCAAAACT TGAAGATGTC GAAAGCGGTG ATTCGGTATA CATTAAACTT GACCAAAACG GAGCAATAGA AATGATAAGT GCCGTGGACA ACTATGTGGT AAAGTACGGG AAAATAATAT CTGTAAGGCC GGGGCTTATA AATGTTGCCT ATGATGACGG CACGCAGCAA ATTCTTAGTG TTGACGAAAA CGTTCCGATT GTGCAGGACA ACAAACTTAT GGATTTTGAT GCCCTGAAGG ATGGAGACAG GGTGAAACTG CTTTTAAATA TCACAGACAA GGCTACAACC CTGAAAAAGC TTACCGTAGA AGGAGACGAG CATTTTATTG CAAATATCTA CCGGGGTATA GTGGCATCGG TGGATGATAT TTCGAAAAAG CTGTTGCTTC AGCACCTTGA AGTGTTTAAC AACGGAGAAT GGGAAAGAAC GGAACAGAAA GGTTTTACCT CATTGAAAAT GTCTGAGGAT AATAAATTTG TTTACAATGA CGATATTGTG GACATGCGCC GGGCAAAGGA GCTGATTGCG GGAAGCACAG CGTATATTGC AGTTGAGAAA GACTACGGCG GAATTGAAAA GGCGGTTTTT GTATCCGTTA AAAACTCCAA AGACAGTGAG GCTCCGGTGC TTAACGACAG CATCATGTCA GCCACTTTCG GAAATAAAGG GGAATTCACG CTGAAAAAAG AATATCTGAA CATTTCTTAC GGAAACGGGA CAATCATTGT AAAGGACAAC AGGCTGGTTA CCGGAAACAG CATAAATTCG GAGGACAGGG CATATGTTGT TGCAAGCAGA AGCTATGACG ACGGAAAATA TTATGCTTTT ATAGTCAAGA TAGATGACAA GACGGATTCG AACTTCTTTA CCATATATCG CGGAAGGATA GCAAACATAA ATGAGAACAA AGACTTTACT TTGGAATCTT ATTCAGAGCT TAAAGGAGTT GAATGGAACT ACTACAACAC CCCCAGAACC TTTAGAATCA CATATGACAC GCAAATTCTA GGTGATGACG GAGTTGTAGG ACAGAGGGAC TTTACCGACT ATGGTGATTC AAGCTATAAG AGCAGGACGG TATATGTATT GTCACACGGT GCCGATGCTG TGTTGATTAG TACTGCTCCT TACGGTAACA TTAACGTAAA AGGTAAAATT TATGAACTTA TATCAGAAAA TGCCGATGGT TCAGAGGCTC AGGAAGGACA GCAAGAACCG GTGGGATTTA AACTCCAGAA CTCAAAAGTT TACGATTTGC AATCGCATAT GTGGGTGGAC GGTAAAGACA TGGATATTAA CCTTCTTAAA AACAGCATTA TACTTAAGGA CAACAAAATT ATAAAACCGT CCGAATTAAA GAAAGGCGAC AGTGTAAGGC TTATCAAAAA AGATGACGAA CAGGCTGGAG ATGCCTATAT TATATTCGTT GAATAA
|
Protein sequence | MEKKFSNKPG IRICFVMVFS FVLSFALNIP VHADEEPEKI YYGLKNASTI LNNINFTDVR NSATWSKEAI CEAAALDIVK GYGNRVFGRT NNVTKEEAIA IIYRAAGREK DAQLAAEALE TARNAEDRSS YAPSMWSDGY LQLAAGEGLI SSQDLQDAFT ADQSSLGTGA FLRRAPAQRQ EVAYWISKVF GIEPVYGQQK IFNSFRDWSS ADPLKIPYIE AVLVENIMNG EGNGYFRPTG FVTREQIAQI IKNADKRVLP LLGYEKKMGT VEDIRKEYDF TGNTHVYTNT FHIRSSNGKL HKIVTEFYDS SRSGYRNEQG GSSIKPSDTD LIVYRDGWIG TSSLLKVGDR VEYITAADGT VRYVRVISAT SDTEYIVAKI KEVNYDTSTI KAAIGFYLEY PDIDMDSISF DFRNSDGTVD ATLVYSNIVK VFIEGRIAGI RDIEPGTDAI LTVKDNIITT ITTLDLRLKE QGVISGIVEE NNPHLGYISL YSDNGLRVDD SLNKIKIYNY SNPEEVTVYK NHVPAKLEDV ESGDSVYIKL DQNGAIEMIS AVDNYVVKYG KIISVRPGLI NVAYDDGTQQ ILSVDENVPI VQDNKLMDFD ALKDGDRVKL LLNITDKATT LKKLTVEGDE HFIANIYRGI VASVDDISKK LLLQHLEVFN NGEWERTEQK GFTSLKMSED NKFVYNDDIV DMRRAKELIA GSTAYIAVEK DYGGIEKAVF VSVKNSKDSE APVLNDSIMS ATFGNKGEFT LKKEYLNISY GNGTIIVKDN RLVTGNSINS EDRAYVVASR SYDDGKYYAF IVKIDDKTDS NFFTIYRGRI ANINENKDFT LESYSELKGV EWNYYNTPRT FRITYDTQIL GDDGVVGQRD FTDYGDSSYK SRTVYVLSHG ADAVLISTAP YGNINVKGKI YELISENADG SEAQEGQQEP VGFKLQNSKV YDLQSHMWVD GKDMDINLLK NSIILKDNKI IKPSELKKGD SVRLIKKDDE QAGDAYIIFV E
|
| |