Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1571 |
Symbol | |
ID | 4810078 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1899604 |
End bp | 1900671 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640106989 |
Product | binding-protein-dependent transport systems inner membrane component |
Protein accession | YP_001037990 |
Protein GI | 125974080 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0600] ABC-type nitrate/sulfonate/bicarbonate transport system, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACGATT TAGTAAAACT ACAAAGCCCA ACATCCTCGG AATTAAAAGT AAAGCAACAA TATGGAAAAC TAAAAACGGT TTCGGCCCAA AATTGGAGTA CTAATTTTGT ACTGACACTG GCAGGGTTTG CTGTAGCCAT AGCAGCCAAT ATTCTGGTGA AGCAAGTGGC AAATGTGGAA ATCAAAACTT ATTTTGTTGC AGGAATTCCC TTCACATTTA ATTTGCTTTA CCGATTGGTC CTGGTAGTAA TTATTTTGAT TTATGCCGGT ATGGGAATAT TTTCATATTT TGATCCTGTG CGAAGAGAAA AATTTTCCAG GCAGGCTCCC TTTCGTTTCG CTATGGGTCT GGCTATTACA GCTTGGGATA TATTAGGAAC AAAGCTTCTT TTATTACCGC AGCCGTTTTT CCCGGGACCG GCAAAAATTC TTGAATCCTT TTTGATTGAA GGAGATTTTA TTTTACAAAA TACCCTTTAC TCCATTAAAC TTTTTTTGGC CGGATTTACT CTGGGTGTGG TGTTTGGGGT GGGTACGGGA ATTTTAATCG GTTGGTTTCC AAAGGTATAT TATTGGGTAT ATCCAATATT AAAGATTACC GGTGTCATTC CTGCCGTAGC GTGGATGCCC TTTGCACTGA CGCTTTTCCC GTCTCCGTTT TCAGCAGCTG TTTTTCTTAT TGTAATATGC GCATGGTTTT CCATTGCGGC TCTTACAGCC CAGGGCATAC AATCCACCCC AAAGTATCAG TTCGAAGTAG CTCGCACCCT GGGGGCAAAA ACGCCATTTT TAATATTCCA TGTTGCAGTG CCTCAGGCCA TGCCCCAGAT TTTTACCGGT ATTTCCAATG CCAATGGTTT TGCTTTTACA ACTTTGGTAA TGGCAGAGAT GATGGGCCAG CCGGGAGGAT TGGGTTACTA CATAAATTTA AGCAAGGTGT GGTCTGCCTA CTATAAGGTA TTTGCCTCCA TTCTCGTTAT GGCCCTGCTG TTCAGTCTAA TTATGAAAAT ACTGGGGCTA ATACAGGCGA ATGTACTCCG GTGGCAGAAA GGATGGGTTA AGGAATGA
|
Protein sequence | MYDLVKLQSP TSSELKVKQQ YGKLKTVSAQ NWSTNFVLTL AGFAVAIAAN ILVKQVANVE IKTYFVAGIP FTFNLLYRLV LVVIILIYAG MGIFSYFDPV RREKFSRQAP FRFAMGLAIT AWDILGTKLL LLPQPFFPGP AKILESFLIE GDFILQNTLY SIKLFLAGFT LGVVFGVGTG ILIGWFPKVY YWVYPILKIT GVIPAVAWMP FALTLFPSPF SAAVFLIVIC AWFSIAALTA QGIQSTPKYQ FEVARTLGAK TPFLIFHVAV PQAMPQIFTG ISNANGFAFT TLVMAEMMGQ PGGLGYYINL SKVWSAYYKV FASILVMALL FSLIMKILGL IQANVLRWQK GWVKE
|
| |