Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2789 |
Symbol | |
ID | 4810106 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3289547 |
End bp | 3290629 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640108209 |
Product | NLPA lipoprotein |
Protein accession | YP_001039181 |
Protein GI | 125975271 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAG GATTGAGAAT GAAAAAGAAA ATTCTTACGG GAATTATATC ATTTCTGATT ACAAGCATAT TGCTAAGCGG ATGCGGAGGT CAAAAAGTTC AAAAGACCAA TTTGGAAGAC TCTGGTACCG GGCGGGAATT GAAGAAATTT AAAGTGGGTT ATCTGGCGTC GCCGGGGCAC GTGCTTTACT TTGTAGCGAA AGAAAAAGGA TTTTTTGAAG AAGAAGGGCT GGACGTGGAA CTGTTTTTGT TTACAAACTC CGGGGAAGGC TTAAATGCAA TCAGTTCCGG CAAAGTGGAC GTTGGTTCCT TTGGTACGGC GGCACCACTT ACTTTCATTT CGAAAGGGAC TGATTTTGTC ATCTTTGGCG GGCAGCAAAC CGAGGGACAC GGAATTGTGG CTCTTCCTCA GAAGGCAAAG GAGCTTACGG ATTTAAACAG CTTTAAAGGC AAAACGATTG CCACTGTAAG GCTTGCCACG GGAGACGTTA TTTTCAGGGC TGCATTGTCG GAAGCAGGCA TTGACTGGAA AAATGAACTG ACAATTAATG AGTTGGATTC ACCGGCAGCG GTACTTGAGG CTGTGAAGAA AGGAAGTGTT GATGCAGGTA TTGTGTGGAC TCCTTTTATT AAACTTGGAG AAAAGCAGGG ACTTGAAATA GTAAAATACT CCGGAGAACT GGTTAAAATG CATACTTGCT GCCGTCAGGT TGCCCTCTCA TCTAATGTAA AGGAAAACAA AGAAGACTTT GTAAGGTTTA TGGCAGCTTT GATAAAGGCA TACAAATTCT ATATGGAAAA TCAGGATGAA ACAGTGGATA TTATTGCGAA ATATGCAAAA GTTGACAAAG ACATTATAAA GGAAGAAACA TACGGAGGCC ATATATACAG CATACCTGAC CCTGACAAAG AGGGAGTTAT AAGGTTCTGG GACCTTATAA ACAAGTCCGG ATATATCAGT TCTGATCTCA ATATTGAAAA TTTTATAGAC ACATCAATCT ATAAGCTTGC TCTTGAAGAT GTTCTTAAGG AGTATCCAAA TGATGAGGTT TATAAGAAGC TAAAAGCAGA CTTTAAGGAG TAA
|
Protein sequence | MKKGLRMKKK ILTGIISFLI TSILLSGCGG QKVQKTNLED SGTGRELKKF KVGYLASPGH VLYFVAKEKG FFEEEGLDVE LFLFTNSGEG LNAISSGKVD VGSFGTAAPL TFISKGTDFV IFGGQQTEGH GIVALPQKAK ELTDLNSFKG KTIATVRLAT GDVIFRAALS EAGIDWKNEL TINELDSPAA VLEAVKKGSV DAGIVWTPFI KLGEKQGLEI VKYSGELVKM HTCCRQVALS SNVKENKEDF VRFMAALIKA YKFYMENQDE TVDIIAKYAK VDKDIIKEET YGGHIYSIPD PDKEGVIRFW DLINKSGYIS SDLNIENFID TSIYKLALED VLKEYPNDEV YKKLKADFKE
|
| |