Gene M446_5479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5479 
Symbol 
ID6129636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp6009709 
End bp6012105 
Gene Length2397 bp 
Protein Length798 aa 
Translation table11 
GC content72% 
IMG OID641645613 
Productsulfate adenylyltransferase, large subunit 
Protein accessionYP_001772229 
Protein GI170743574 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component
[COG2895] GTPases - Sulfate adenylate transferase subunit 1 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00971] sulfate/thiosulfate-binding protein
[TIGR02034] sulfate adenylyltransferase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0197363 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00943774 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACGGTGC ACCAGTCCAC CCGCGCCTTC GGCTACGAGG CCTTCCTCGC CGCCCACCAG 
CGCAAGGAGG TGCTGCGCTT CATCGCCTGC GGCTCCGTCG ACGACGGCAA GTCGACCCTG
ATCGGGCGCC TGCTGCACGA CACCAAGCAG ATCTTCGACG ACCAGGTGAC CGCGCTGGAG
CGCGATTCCC GCCGCCACGG CACGCGCGGC GGCGAGATCG ACCTCGCGCT CTTGGTCGAC
GGCCTGCAGG CGGAGCGCGA GCAGGGCATC ACCATCGACG TCGCCTACCG GTTCTTCTCG
ACCGAGCGGC GCTCCTTCAT CGTCGCCGAC ACGCCCGGCC ACGAGCAGTA CACCCGCAAC
ATGGCGACCG GCGCCTCGAC GGCCGACGTC GCGGTGCTGC TCGTCGATGC CCGCAAGGGC
CTGAGCCGGC AGACGCGGCG CCACGCGCTC CTGGTCTCGA TGCTGGGCAT CCGGCGCGTC
GTGCTCGCCG TCAACAAGAT GGACCTGATC GGCTGGTCGG AGACCCGCTT CGAGGCGATC
GCGGGCGAGT TCCGCGCCTT CGCGGCCCCG CTCGGCTTCG CCGACGTGAC GGCGATCCCG
CTCTCGGCCG CGAACGGCGA CAACGTCGTG CTGCCGGGCG CCGCCGCGCC CTGGTATGCC
GGCCCGCCGC TGCTGCAGCA CCTGGAGGAG GTGCCGGCCC ACGCGGAGGA GGAGGCCGCC
CCCTTCCGCA TGGCGGTGCA GTGGGTGAAC CGGCCGAACC CCGATTTCCG CGGCTTCTCC
GGAATGATCG CCTCGGGCCG GGTGGCGCCG GGCGACGCCG TCGCGCTGCT GCCCTCGGGC
CAAGCCTCGA CCATCGCGCG GATCTTCACG GCGGACGGCG ACCTCGACGA GGCCGTGGCC
GGCCAGTCGG TGACGCTGGT CCTCGCCGAC GAGCGCGACG CCTCGCGCGG CAGCGTGATC
GCGGCGGCGG GCGCCCCGCC GCGGGTCGCC GACCGGCTCG ACGTGCGCCT GTTCTGGGCG
CGGGAGAGCG AGGTCGCGGC GGGCGCGACC CTCATCGCCA AGATCGGCAC CGCGACGGCG
AACGCGACCG TCGAGCGCAT CGTCTCGCGC ATCGATCCCG AGACCGGCCT GTCGGAGCCC
GCCGAGCGGC TCGCCGTGAA CGACATCGGC GACGTGGTGC TCAGCCTCGA CCGGCCGGTG
GCGGTGGACG CCTACCGGGA GAACCGCGAC ACCGGCAGCC TGATCCTGAT CGACCGCGAC
TCCACCGACA CGGCCGCCCT CGGCCTCGTC CAAGTCCCGG CCCAAGTTTC AGCCCAAGGC
CCGGCGCGAG ACCCGGTCGC CGCCCGCCGC GGCGGGTTCC TCTCCGGCCT GCGCCGGCTG
TTCGGGGCGA AGGGGCTCGC GCTCCTCGCC GGGGCGGCCC TCCTCGGCCT CGCCGCGCCC
GCGCGGGTCG AGGCGCAGGC GGTGCTGCTC AACGTCTCGT ACGATCCGAC CCGGGAGCTC
TACCGGGCGA TCGACGCCGC CTTCGCGGCC GAGTGGAAGC AGAAGACCGG CGAGAGCGTG
ACCGTGCGCG CCTCCCACGG CGGCTCGGGC GCGCAGGCCC GGGCGGTGAT CGACGGGCTC
CCCGCCGACG TCGTGACGCT GGCCCTCGCC AGCGACATCG ACGCGATCGC GGCCCGCACC
GGGAAGATCC CGGCGGACTG GCAGAAGCGC CTGCCCCACA ACGCGACGCC CTATACGTCG
ACCATCGTGT TCCTGGTGCG CAAGGGCAAC CCGAAAGCCA TCAAGGACTG GAACGACCTG
GTGAGGCCGG GGATCCAGGT GATCACCCCG AACCCGAAGA CCTCGGGCGG CGCGCGCTGG
AACTACCTCG CGGCCTACGC CTACGGGCTC GCCCAGAACG GCAACGACGA CGCTAGGGCC
AAGGCCTTCG TGGCGGCCCT GTTCAAGAAC GTGCCGGTCC TCGATACCGG CGCGCGCGGC
GCCACGACGA CCTTCGTGCA GCGCGGCCTC GGCGACGTCC TGATCGCCTG GGAGAACGAG
GCCTTCCTGG CGGACGAGGA GTTCGGGAAG GGCAAGTTCG ACATCGTCGT CCCCTCGCTC
TCGATCCTGG CCGAGCCGCC GGTGGCGCTC GTCGACGGCA ACGTCGACCA GAAGGGCACC
CGCCGCCAGG CCGAGGCCTA CCTGCAATTC CTGTACGGCA AACAGGCCCA GGCCCTCATC
GCCAAGAACT TCTACCGCCC GCGCGACGAA TCCGCCGCCG CCAAGGAGGA CCTCGCCCGC
TTCCCGAAGC TGAAGCTCGT CACCATCGAC GACACCTTCG GCGGCTGGGG CAAGGCGCAG
AAGACCCATT TCGACGATGG CGGCGTCTTC GACGCAATCC TGAAGGCGCG GCAGTGA
 
Protein sequence
MTVHQSTRAF GYEAFLAAHQ RKEVLRFIAC GSVDDGKSTL IGRLLHDTKQ IFDDQVTALE 
RDSRRHGTRG GEIDLALLVD GLQAEREQGI TIDVAYRFFS TERRSFIVAD TPGHEQYTRN
MATGASTADV AVLLVDARKG LSRQTRRHAL LVSMLGIRRV VLAVNKMDLI GWSETRFEAI
AGEFRAFAAP LGFADVTAIP LSAANGDNVV LPGAAAPWYA GPPLLQHLEE VPAHAEEEAA
PFRMAVQWVN RPNPDFRGFS GMIASGRVAP GDAVALLPSG QASTIARIFT ADGDLDEAVA
GQSVTLVLAD ERDASRGSVI AAAGAPPRVA DRLDVRLFWA RESEVAAGAT LIAKIGTATA
NATVERIVSR IDPETGLSEP AERLAVNDIG DVVLSLDRPV AVDAYRENRD TGSLILIDRD
STDTAALGLV QVPAQVSAQG PARDPVAARR GGFLSGLRRL FGAKGLALLA GAALLGLAAP
ARVEAQAVLL NVSYDPTREL YRAIDAAFAA EWKQKTGESV TVRASHGGSG AQARAVIDGL
PADVVTLALA SDIDAIAART GKIPADWQKR LPHNATPYTS TIVFLVRKGN PKAIKDWNDL
VRPGIQVITP NPKTSGGARW NYLAAYAYGL AQNGNDDARA KAFVAALFKN VPVLDTGARG
ATTTFVQRGL GDVLIAWENE AFLADEEFGK GKFDIVVPSL SILAEPPVAL VDGNVDQKGT
RRQAEAYLQF LYGKQAQALI AKNFYRPRDE SAAAKEDLAR FPKLKLVTID DTFGGWGKAQ
KTHFDDGGVF DAILKARQ