Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2537 |
Symbol | |
ID | 4809293 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3006426 |
End bp | 3008225 |
Gene Length | 1800 bp |
Protein Length | 599 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640107953 |
Product | adenylylsulfate kinase / sulfate adenylyltransferase subunit 1 |
Protein accession | YP_001038932 |
Protein GI | 125975022 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0529] Adenylylsulfate kinase and related kinases [COG2895] GTPases - Sulfate adenylate transferase subunit 1 |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR00485] translation elongation factor TU [TIGR02034] sulfate adenylyltransferase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGCAA GAGAACAAAT GAATATTGTA ATCGTCGGTC ATGTGGATCA TGGAAAAAGC ACCGTCATAG GTAGACTGCT TGCGGATACC GGCTCTCTTC CGGAGGGAAA GCTTGAGTCT GTCAAAGAGT TTTGCAGAAA GAATGCCAGG CCTTTTGAGT ACGCGTTTTT GCTGGACGCA TTAAAGGATG AACAGGCGCA GGGCATTACC ATAGATACTG CAAGATGTTT TTTCAAGACA AACAAAAGGG ACTACATTAT TATCGACGCA CCGGGGCATG TTGAGTTCTT AAAGAACATG GTTACGGGAG CGTCCCGGGC GGAAGCCGCC CTTTTGGTAA TAGACGCGAA GGAAGGTATA AAGGAAAATT CCAAACGCCA CGGACATATT GTTTCCATGC TGGGAATCAA ACAAGTGGTT GTTTTGGTGA ACAAAATGGA TTTGGTGGGC TTTGACAGGG AAGTTTATGA AGCTATTGTC TCAGAGTTTG GCGAGTTTTT GCAAAAGGTT AACATAAGAC CAATTAATTA TATTCCAATA AGTGCCTTCA ACGGAGACAA TATTGCCCAA AGGTCCCGGA ACACTTTGTG GTATGACGGG CCCACGGTTT TGGAACAGTT GGATGGGTTT GTGAATAAAA AAGAAAATCG TCAGCTTCCG TTCCGCATGC CTGTACAGGA TATTTACAAA TTTACCGAAG AGGGCGATGA CCGAAGGATT GTGGCAGGTA CAATCATAAG CGGCTCAATC AGTGTGGGGG ACGAGGTTGT ATTTCTTCCT TCAAACAAGA AGTCGGTAAT AAAAAGTATA GAGGGATTTA ATGTAAAACC CAGAAATACG GCCTATGCAG ACGAGGCAAT AGGAGTAACG CTGACCACAC AAATTTATAT AAAGCCCGGA GAACTGATGG TGAAGGCAAA TGAAAAACAT CCGTCAGTGA GCTCCCGCTT TAGGGCGAAC ATATTCTGGG TTGGCAAGGC TCCTTTGATA AAGAACAAAA ACTATAAGTT GAAAATCGGT ACGATGAAAA TTGGCGTCAA ACTCATTGAA ATATCCCATA TCATTGATGC GGCGGAGCTC AACATTGACA CTTTCAAAGA CCAGGTTGAA AGACATGATG TGGCAGAGTG CATTTTTGAA ACCGCAAAAC CTATTGCATA TGATGTTATT TCCGAAATCG AGCAGACCGG AAGGTTTGTA ATTGTGGACA ACTATGAGAT ATCCGGCGGA GGAATTATTT TGGAAGCAGT TCCGGATACC GACAGCAGCT TGCTGACCCA CATCAGGGAA AGAGAATTTT TGTGGGAGAA AAGTTTGATT TCTGCAAAGC AAAGGGAAAA TGCTTATGGA CACAAAGCGA AGTTTATCGT AATTACTTCG GGAAGCGAAG GAAAAGAAAA GGATATCCAG GATATCGGAA GACAATTGGA AGAGCGGCTT TTCAACATGA AGTACAAAGC GTATTATCTC GGTGTTTCAA GCATACTGCA CGGGCTTGCG TCGGATGTGG CAAACAGCTA TGAGGACAGA GACGAGCATA TAAGGCAGAT TGGAGAACTG GCAAGGATAT TTACCGATTC GGGCCAAATA TTTATCACCA GCATATTCAA TCTGGATGAC TATGAGGCCA AAAAGCTTAA ACTTTTAAAC CAGCCCAATG AAATCATAGT GGTGAACATA GGACAGACGC CTTTCAACAA TTTTGTGCCC GATGCAAACA TAGAAGATAC GGAGGGCGCG GTTGAGGCTG TGTGTGAGTT GTTGAAACGT CAGGAAATTA TACTTGAATA TTATATATGA
|
Protein sequence | MEAREQMNIV IVGHVDHGKS TVIGRLLADT GSLPEGKLES VKEFCRKNAR PFEYAFLLDA LKDEQAQGIT IDTARCFFKT NKRDYIIIDA PGHVEFLKNM VTGASRAEAA LLVIDAKEGI KENSKRHGHI VSMLGIKQVV VLVNKMDLVG FDREVYEAIV SEFGEFLQKV NIRPINYIPI SAFNGDNIAQ RSRNTLWYDG PTVLEQLDGF VNKKENRQLP FRMPVQDIYK FTEEGDDRRI VAGTIISGSI SVGDEVVFLP SNKKSVIKSI EGFNVKPRNT AYADEAIGVT LTTQIYIKPG ELMVKANEKH PSVSSRFRAN IFWVGKAPLI KNKNYKLKIG TMKIGVKLIE ISHIIDAAEL NIDTFKDQVE RHDVAECIFE TAKPIAYDVI SEIEQTGRFV IVDNYEISGG GIILEAVPDT DSSLLTHIRE REFLWEKSLI SAKQRENAYG HKAKFIVITS GSEGKEKDIQ DIGRQLEERL FNMKYKAYYL GVSSILHGLA SDVANSYEDR DEHIRQIGEL ARIFTDSGQI FITSIFNLDD YEAKKLKLLN QPNEIIVVNI GQTPFNNFVP DANIEDTEGA VEAVCELLKR QEIILEYYI
|
| |