Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2575 |
Symbol | |
ID | 4809182 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3047311 |
End bp | 3048321 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640107989 |
Product | hypothetical protein |
Protein accession | YP_001038968 |
Protein GI | 125975058 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.225151 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA CCATGAAAAA AGCTGCTTTG ATTTTGATTG CGGCGGCGAT GTTAATGATG TCCTTTGCGG GCTGCGCCGA AAGGGATTAT ACTACTGTGA GGCTGAATGA AGTTACCCGT TCGGTGTTTT ATGCTCCCCA GTATGTTGCG TTAAACTTGG GTTTCTTTGA GGAAGAAGGA CTTAAAATTG ACATTGCAAG CGGTCAGGGT GCTGACAAGG TAATGACAGC GGTTTTGTCC GGGCAGGCGG ATATAGGTTT TTCCGGACCA GAAGCTGCCA TATATGTGTA CAATGAAGGC AGGGAAGACT ATGCCGTTGT GTTTGCACAG CTTACAAAAC GTGACGGTTC GTTTTTGGTG GGAAGAAAAC CGGAACCGGA TTTTAAATGG GAAAACCTCA AAGGAAAGAC GATAATAGGC GGAAGAAAAG GCGGAGTTCC CGAAATGACT CTTGAGTATG TGCTGAAGAA AAACAACCTG ATACCGGGAG TTGATGTATA TATTGATACA AGTGTTCAGT TTGCTTTAAT GGCAGGGGCG TTTACGGGAG GACAGGGTGA CTATGTTACG CTGTTTGAAC CGGTGGCTTC CACAGTGGAA AAGGAAGGCA AGGGATATAT AATTACATCC ATAGGTAAAG AAAGCGGTGA GATTCCTTAT ACAGCATATT ATGCAAGCAA GAGCTACATA GAAAAAAACA AAGACATAAT CCAGAAATTT ACCAACGCCA TTTACAAAGG CCAGAAGTGG GTTGAAACTC ACACACCGGA GGAAATAGCG GATGTTATAA AACCTTCTTT CCCCGATTCG GATAAGGAGA CACTGATAAC GGTGGCAAAG AGATACAAGG AAACGGATGT ATGGAACAAA GACCCTATAT TGAAAAAAGA ATCCCTTGAC CTTCTTCAGG AAGTAATGAG CATGGCAGGG GAACTTAAAA AAGAAGCGCC TTATGAAAAA ATAGTAACAA AAGAGTTTGC CGAAAAAGCT ATGGAGAACT TTGAGAACTA A
|
Protein sequence | MKKTMKKAAL ILIAAAMLMM SFAGCAERDY TTVRLNEVTR SVFYAPQYVA LNLGFFEEEG LKIDIASGQG ADKVMTAVLS GQADIGFSGP EAAIYVYNEG REDYAVVFAQ LTKRDGSFLV GRKPEPDFKW ENLKGKTIIG GRKGGVPEMT LEYVLKKNNL IPGVDVYIDT SVQFALMAGA FTGGQGDYVT LFEPVASTVE KEGKGYIITS IGKESGEIPY TAYYASKSYI EKNKDIIQKF TNAIYKGQKW VETHTPEEIA DVIKPSFPDS DKETLITVAK RYKETDVWNK DPILKKESLD LLQEVMSMAG ELKKEAPYEK IVTKEFAEKA MENFEN
|
| |