Gene Msil_1780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1780 
Symbol 
ID7090897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1940864 
End bp1942807 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content63% 
IMG OID643465108 
Productsulfate adenylyltransferase, large subunit 
Protein accessionYP_002362088 
Protein GI217977941 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0529] Adenylylsulfate kinase and related kinases
[COG2895] GTPases - Sulfate adenylate transferase subunit 1 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00455] adenylylsulfate kinase (apsK)
[TIGR02034] sulfate adenylyltransferase, large subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGAAT CGGTCGCCGC CGCGTCGTCG GGCGGACTGT CGGCGCAAGC GGAGCCGTCT 
GACGCCGCGC CGCCGCCAAC CTTGCGGCTG ATCACTTGCG GCTCCGTCGA TGACGGCAAG
TCGACGCTGA TCGGACGCCT GCTGTTTGAA CAGAGCCTCA TCTTCGACGA TCAGTTCGCC
GCTCTCGAGC GGGACTCGAA GCGGCACGGC ACGACCGGCG ACGACATCGA CTTCGCCCTT
CTCGTCGACG GCCTCGAGGC CGAACGCGAG CAGGGCATCA CCATCGACGT CGCCTATCGC
TATGTCTCGA CGCCGCGCCG CGCCTTCATC ATCGCCGACA CCCCCGGCCA TGAGCAATAT
ACGCGCAACA TGGCGACCGC CGCCTCAAGC GCCAGCCTCG CCATTCTGCT CGTCGACGCC
CGCAAGGGGC TGCTCACCCA GACCTGCCGT CACGCCGCCA TCGTCTCGCT AATCGGCATC
AAACATGTCG TGCTGGCCGT GAACAAGATC GACCTCACCG GCTTCGACGA GGCGGGGTTC
CACAAGATCG TCGCCGACTT CAAGGCTTTC GCCGCGACGC TCGGCTTCGC CTCGATCACG
CCGATCCCGA TCTCCGCCCG CCATGGCGAC AATGTCTCGG AAAAAAGCTC CAACACGCCT
TGGCACAAGG GGCCGGCGCT GCTCTCCTTC CTCGAGACAG TCGACGTCGA GGAAGACCAG
AGCGCAAAAC CGTTCCGCTT TCCCGTGCAA TGGGTGAACC GGCCCCATCT CGACTTTCGC
GGCTTCGCCG GGACGATCGC CAGCGGTTCG GTGCGGCCGG GCGATGAAAT CGTCGTCGCC
GGTTCCGGCA AGATCTCGAA GGTCGCGCGC ATCGTCGCCG CCGACGGCGA TCTTGCCAGC
GCCGGCGCGG GCGAGGCCGT CACCCTGACG CTTGAAGACG AACTCGACAT CGCGCGCGGC
GATCTTCTCT CCGACGCCAA AAGCCGGCCG GAGGTGTCCG ATCAATTCGC CGCCCATCTC
ATCTGGATGA GCGAAGACAA GCTCATGCCC GGCCGCTCCT ATCTCTTGAA GAGCGGCGCG
AAAACCGTTC CCGTCACGGT GACGGAGCTG AAGCACCGCA TCGACGTCAA CACGCTCGGC
GAGCTCCCGG CGCGCACCCT GAGCCTTAAC GAAGTCGGCG TCTGCAATCT GGCGACAGCG
ACGCCGATCG CCTTCGACCC TTACACAGAC AATCACACCA CCGGTTCGTT CATTCTGATC
GACCGCGCGA CCAACGCCAC CGCGGCCGCC GGCCTGATCT CTTTTGGCCT GCGCCGCGCC
ACCAATATTC ATCGGCATGG GCTCAGCATC GGCAAGCTCG ACCGCGCGCG GCTGAACAAT
CAGAAGCCGG CGGTGTTGTG GTTCACGGGG CTTTCGGGCT CGGGCAAATC CACCATCTCC
AATCTCGTCG AGTCCTGGCT TTATGCGCAT GGCATGCGCA CCATCCTGCT CGACGGCGAC
AATATCCGCC ACGGGCTCAA CAAGAATCTC GGCTTCACTG AGGTCGACCG CGTCGAAAAT
ATCCGCCGCG TCGGGGAGGT CGCCAAGTTG ATGACGGATG CGGGACTTAT CGTCCTTTGC
TCCTTCATCT CGCCGTTCAA CGCCGAACGC CAACTTGTGC GCGATTTGCT GGACGACGGC
GAATTCTTGG AGATTTTCGT CGACACACCG ATCGAGGATT GCATCGCGCG CGACCCGAAG
GGCCTCTACA AAAAGGCGCT CGCCGGCGAG ATCAAGAATT TCACCGGCGT CGATCAGCGC
TATGAGGCGC CGCAAAATCC CGAAATGATC GTCGCCCGTG ACGGCCAGAC GCCGCAACAG
GCCGCGGCCG CCATCGTCAA GGAGCTAATC CGGCGCGGCT TCATCGACAG CTTCGCTGAT
CCGGGAGACG ATTTCTCGAT CTGA
 
Protein sequence
MLESVAAASS GGLSAQAEPS DAAPPPTLRL ITCGSVDDGK STLIGRLLFE QSLIFDDQFA 
ALERDSKRHG TTGDDIDFAL LVDGLEAERE QGITIDVAYR YVSTPRRAFI IADTPGHEQY
TRNMATAASS ASLAILLVDA RKGLLTQTCR HAAIVSLIGI KHVVLAVNKI DLTGFDEAGF
HKIVADFKAF AATLGFASIT PIPISARHGD NVSEKSSNTP WHKGPALLSF LETVDVEEDQ
SAKPFRFPVQ WVNRPHLDFR GFAGTIASGS VRPGDEIVVA GSGKISKVAR IVAADGDLAS
AGAGEAVTLT LEDELDIARG DLLSDAKSRP EVSDQFAAHL IWMSEDKLMP GRSYLLKSGA
KTVPVTVTEL KHRIDVNTLG ELPARTLSLN EVGVCNLATA TPIAFDPYTD NHTTGSFILI
DRATNATAAA GLISFGLRRA TNIHRHGLSI GKLDRARLNN QKPAVLWFTG LSGSGKSTIS
NLVESWLYAH GMRTILLDGD NIRHGLNKNL GFTEVDRVEN IRRVGEVAKL MTDAGLIVLC
SFISPFNAER QLVRDLLDDG EFLEIFVDTP IEDCIARDPK GLYKKALAGE IKNFTGVDQR
YEAPQNPEMI VARDGQTPQQ AAAAIVKELI RRGFIDSFAD PGDDFSI