Gene Mlg_2345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2345 
Symbol 
ID4269101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2657116 
End bp2659032 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content67% 
IMG OID638127103 
Productsulfate adenylyltransferase subunit 1 / adenylylsulfate kinase 
Protein accessionYP_743175 
Protein GI114321492 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0529] Adenylylsulfate kinase and related kinases
[COG2895] GTPases - Sulfate adenylate transferase subunit 1 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00455] adenylylsulfate kinase (apsK)
[TIGR02034] sulfate adenylyltransferase, large subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCACG CATCCGAGCT CATCGCCCAG GACATTGAGG CCTACCTGAA GCAGCACGAG 
CAGAAGGACC TGCTGCGCTT TATCACCTGC GGCAGCGTGG ACGACGGCAA GAGCACGCTG
ATCGGCCGGC TGCTGCACGA CTCGCAGCTC ATCTACGAGG ACCAGCTCGC CGCCATCAAG
CGCGACTCGG GCAAGTACGG CACCACCGGC GATGAGGTGG ACCTGGCGCT GCTGGTGGAC
GGCCTGCAGT CGGAGCGCGA ACAGGGCATC ACCATCGATG TGGCCTACCG CTACTTCTCC
ACCGAGAAGC GCAAGTTCAT CATTGCCGAC ACCCCGGGCC ACGAGCAGTA CACCCGTAAC
ATGGCCACCG GGGCCTCCAC CGCCAGTCTG GCGATCATCC TGGTGGATGC GCGCAAGGGG
CTGCAGGTGC AGACCCGGCG CCACAGCTTT ATCGTCTCGC TGCTGGGCAT CCGCCACGTG
GTGCTGGCGG TGAACAAGAT GGACCTGGTG GACTACGACC AGGCGGTGTT CGACGCCATC
TGCAAAGATT ACCAGGACTT CGTGAAGCGC CTGGGCATTC CCGATGTGCA CTACGTGCCG
GTCTCCGCCT TGCAGGGCGA TAACGTGGTG AAGCCCTCGA CGGCCATGCC CTGGTACGAC
GGCCCGACGA TGATGCACAT CCTCGAGACC CTGCCGGTGG CGGCCGATCG CAATTTCGAC
GAGTTCCGCC TGCCGGTGCA GTACGTCAAC CGGCCGAACC CGGACTTTCG CGGCTTCTGC
GGCACCATCG CCAGCGGTGT GGTGCATCCG GGCGACCCGG TGGTCGCCCT GCCCTCGGGC
AAGCGCTCGC GGGTGGCGCG CATTGTCACC TGGGAGGGGG ACCTGGCGGA GGCCTTCCCG
CCCCAGGCGG TGACGCTGAC CCTGGAGGAC GAGATCGACG TCAGCCGGGG GGATATCCTG
GCCAGCCCGG ACACGGTGCC GACCGTCTCC GAGGCCTTCG ATGCGCGCAT TGTCTGGATG
GCCGAGCAGC CGATGCTGCC CGGGCGGCAA TACGACATCA AGCTGGGCAC GCGCCTGGTG
CCGGGCACGC CCACGGCGGT GCACCACCGG ATCGACGTGA ACTCGTTGGA GCACCAGCAC
GCGGAGGAGC TGGGGCTGAA CGAGATCGGT TACTGCCGGG TGACGCTGAA CCAGCCGGTG
CCCTTCGATG CCTACGAGCG GGTGGCGGAC ACCGGCAGCT TTATCCTCAT CGACCGGCTG
ACCAACGTGA CCGTGGGCGC GGGCATGATC GTGCGCCCGG CGGAGCGGGC GGTGCCGGAG
AAGTCCGACG ATGTGGTCTG GCAGGAGCCG AAGATCAGCA AGCAGCAGCG GGCCAACCAG
AAGACCCAGC GCCCGGCCAT CCTATGGTTC ACCGGGCTGT CCGGCTCGGG CAAGTCGACC
CTGTCCAACG CCCTGGAGCA GCGGCTCTAC CAGCTTGGCT ACCACAGCTA CCTGCTGGAC
GGCGACAACA TCCGCCATGG GCTTAACGGG GACTTGGGCT TCAGCCGCGA GGACCGGGTG
GAGAACATCC GCCGCATCGG CGAGGTGGCA CGGTTGTTCG TGGACGCGGG GCTGCTGGTG
GTGACCGCCT TCATCTCCCC CTTCCGCGCC GACCGGGCGA TGGTGCGCGA GCTGGTGGAG
GACGGTGAGT TCGTGGAGAT CTTCGTCGAC ACCCCGCTGG AGGTCTGCGA GCAGCGCGAC
CCCAAGGGGC TGTACGCCAA GGCCCGCGCC GGGGTGATCA AGGAGTTCAC CGGCATCGAC
TCGCCCTACG AGCCCCCGGA GAAGCCGGAG CTGCACATCC GCACCGCGGA GCTGTCCGTG
GACGAGAGCG TGGAGCGCAT CATCGCCTAT CTGCAGGACC GGCACATCTT GCGGTAG
 
Protein sequence
MSHASELIAQ DIEAYLKQHE QKDLLRFITC GSVDDGKSTL IGRLLHDSQL IYEDQLAAIK 
RDSGKYGTTG DEVDLALLVD GLQSEREQGI TIDVAYRYFS TEKRKFIIAD TPGHEQYTRN
MATGASTASL AIILVDARKG LQVQTRRHSF IVSLLGIRHV VLAVNKMDLV DYDQAVFDAI
CKDYQDFVKR LGIPDVHYVP VSALQGDNVV KPSTAMPWYD GPTMMHILET LPVAADRNFD
EFRLPVQYVN RPNPDFRGFC GTIASGVVHP GDPVVALPSG KRSRVARIVT WEGDLAEAFP
PQAVTLTLED EIDVSRGDIL ASPDTVPTVS EAFDARIVWM AEQPMLPGRQ YDIKLGTRLV
PGTPTAVHHR IDVNSLEHQH AEELGLNEIG YCRVTLNQPV PFDAYERVAD TGSFILIDRL
TNVTVGAGMI VRPAERAVPE KSDDVVWQEP KISKQQRANQ KTQRPAILWF TGLSGSGKST
LSNALEQRLY QLGYHSYLLD GDNIRHGLNG DLGFSREDRV ENIRRIGEVA RLFVDAGLLV
VTAFISPFRA DRAMVRELVE DGEFVEIFVD TPLEVCEQRD PKGLYAKARA GVIKEFTGID
SPYEPPEKPE LHIRTAELSV DESVERIIAY LQDRHILR