Gene Cthe_2537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2537 
Symbol 
ID4809293 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3006426 
End bp3008225 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content43% 
IMG OID640107953 
Productadenylylsulfate kinase / sulfate adenylyltransferase subunit 1 
Protein accessionYP_001038932 
Protein GI125975022 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0529] Adenylylsulfate kinase and related kinases
[COG2895] GTPases - Sulfate adenylate transferase subunit 1 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00485] translation elongation factor TU
[TIGR02034] sulfate adenylyltransferase, large subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGCAA GAGAACAAAT GAATATTGTA ATCGTCGGTC ATGTGGATCA TGGAAAAAGC 
ACCGTCATAG GTAGACTGCT TGCGGATACC GGCTCTCTTC CGGAGGGAAA GCTTGAGTCT
GTCAAAGAGT TTTGCAGAAA GAATGCCAGG CCTTTTGAGT ACGCGTTTTT GCTGGACGCA
TTAAAGGATG AACAGGCGCA GGGCATTACC ATAGATACTG CAAGATGTTT TTTCAAGACA
AACAAAAGGG ACTACATTAT TATCGACGCA CCGGGGCATG TTGAGTTCTT AAAGAACATG
GTTACGGGAG CGTCCCGGGC GGAAGCCGCC CTTTTGGTAA TAGACGCGAA GGAAGGTATA
AAGGAAAATT CCAAACGCCA CGGACATATT GTTTCCATGC TGGGAATCAA ACAAGTGGTT
GTTTTGGTGA ACAAAATGGA TTTGGTGGGC TTTGACAGGG AAGTTTATGA AGCTATTGTC
TCAGAGTTTG GCGAGTTTTT GCAAAAGGTT AACATAAGAC CAATTAATTA TATTCCAATA
AGTGCCTTCA ACGGAGACAA TATTGCCCAA AGGTCCCGGA ACACTTTGTG GTATGACGGG
CCCACGGTTT TGGAACAGTT GGATGGGTTT GTGAATAAAA AAGAAAATCG TCAGCTTCCG
TTCCGCATGC CTGTACAGGA TATTTACAAA TTTACCGAAG AGGGCGATGA CCGAAGGATT
GTGGCAGGTA CAATCATAAG CGGCTCAATC AGTGTGGGGG ACGAGGTTGT ATTTCTTCCT
TCAAACAAGA AGTCGGTAAT AAAAAGTATA GAGGGATTTA ATGTAAAACC CAGAAATACG
GCCTATGCAG ACGAGGCAAT AGGAGTAACG CTGACCACAC AAATTTATAT AAAGCCCGGA
GAACTGATGG TGAAGGCAAA TGAAAAACAT CCGTCAGTGA GCTCCCGCTT TAGGGCGAAC
ATATTCTGGG TTGGCAAGGC TCCTTTGATA AAGAACAAAA ACTATAAGTT GAAAATCGGT
ACGATGAAAA TTGGCGTCAA ACTCATTGAA ATATCCCATA TCATTGATGC GGCGGAGCTC
AACATTGACA CTTTCAAAGA CCAGGTTGAA AGACATGATG TGGCAGAGTG CATTTTTGAA
ACCGCAAAAC CTATTGCATA TGATGTTATT TCCGAAATCG AGCAGACCGG AAGGTTTGTA
ATTGTGGACA ACTATGAGAT ATCCGGCGGA GGAATTATTT TGGAAGCAGT TCCGGATACC
GACAGCAGCT TGCTGACCCA CATCAGGGAA AGAGAATTTT TGTGGGAGAA AAGTTTGATT
TCTGCAAAGC AAAGGGAAAA TGCTTATGGA CACAAAGCGA AGTTTATCGT AATTACTTCG
GGAAGCGAAG GAAAAGAAAA GGATATCCAG GATATCGGAA GACAATTGGA AGAGCGGCTT
TTCAACATGA AGTACAAAGC GTATTATCTC GGTGTTTCAA GCATACTGCA CGGGCTTGCG
TCGGATGTGG CAAACAGCTA TGAGGACAGA GACGAGCATA TAAGGCAGAT TGGAGAACTG
GCAAGGATAT TTACCGATTC GGGCCAAATA TTTATCACCA GCATATTCAA TCTGGATGAC
TATGAGGCCA AAAAGCTTAA ACTTTTAAAC CAGCCCAATG AAATCATAGT GGTGAACATA
GGACAGACGC CTTTCAACAA TTTTGTGCCC GATGCAAACA TAGAAGATAC GGAGGGCGCG
GTTGAGGCTG TGTGTGAGTT GTTGAAACGT CAGGAAATTA TACTTGAATA TTATATATGA
 
Protein sequence
MEAREQMNIV IVGHVDHGKS TVIGRLLADT GSLPEGKLES VKEFCRKNAR PFEYAFLLDA 
LKDEQAQGIT IDTARCFFKT NKRDYIIIDA PGHVEFLKNM VTGASRAEAA LLVIDAKEGI
KENSKRHGHI VSMLGIKQVV VLVNKMDLVG FDREVYEAIV SEFGEFLQKV NIRPINYIPI
SAFNGDNIAQ RSRNTLWYDG PTVLEQLDGF VNKKENRQLP FRMPVQDIYK FTEEGDDRRI
VAGTIISGSI SVGDEVVFLP SNKKSVIKSI EGFNVKPRNT AYADEAIGVT LTTQIYIKPG
ELMVKANEKH PSVSSRFRAN IFWVGKAPLI KNKNYKLKIG TMKIGVKLIE ISHIIDAAEL
NIDTFKDQVE RHDVAECIFE TAKPIAYDVI SEIEQTGRFV IVDNYEISGG GIILEAVPDT
DSSLLTHIRE REFLWEKSLI SAKQRENAYG HKAKFIVITS GSEGKEKDIQ DIGRQLEERL
FNMKYKAYYL GVSSILHGLA SDVANSYEDR DEHIRQIGEL ARIFTDSGQI FITSIFNLDD
YEAKKLKLLN QPNEIIVVNI GQTPFNNFVP DANIEDTEGA VEAVCELLKR QEIILEYYI