Gene Mext_2232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2232 
Symbol 
ID5831926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2479286 
End bp2480695 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content69% 
IMG OID641368031 
Productsulfate adenylyltransferase, large subunit 
Protein accessionYP_001639698 
Protein GI163851655 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2895] GTPases - Sulfate adenylate transferase subunit 1 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR02034] sulfate adenylyltransferase, large subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.355056 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCC ATCAGTCTCC GGAAGCGTTC GGCTACGACG CCTTCCTGCG TCAGCACCAG 
AACAAGGAAG TCCTGCGCTT CATCACCTGC GGCTCGGTCG ATGACGGCAA GTCCACCCTG
ATCGGGCGGC TCCTGCACGA CACCAAGCAG ATCTTCGACG ATCAGGTGAC GGCGCTCCAG
CGCGATTCGC GCAAGCACGG CACGCAGGGC GGCGAGGTCG ATCTCGCCCT TCTGGTTGAC
GGACTCCAGG CCGAGCGCGA GCAGGGCATC ACCATCGATG TCGCCTACCG CTTCTTCTCG
ACCGACCGGC GCTCCTTCAT CGTCGCCGAC ACCCCCGGCC ACGAGCAGTA CACCCGCAAC
ATGGCGACCG GCGCCTCGAC CGCCGACCTC GCCGTGATCC TGGTGGACGC CCGCCACGGG
CTGACCCGCC AGAGCCGGCG CCACGCGCTG CTGGTCTCGC TGCTCGGCAT CCGCCGCGTC
GCGCTCGCCA TCAACAAGAT GGACCTCGTC GGCTGGTCGC AGGACAAGTT CGAGGCGATC
GTCTCCGGCT TCCAGGCCTT TGCCGCGCCG CTGAACTTCA CCGAGGTGCG GGCGATCCCG
CTCTCGGCCA AGAACGGCGA CAACGTCGTC CTGCCGGGCA CCGCCGCGAC CTGGTACACG
GACGTTCCGC TGCTGCGCTA TCTCGAAGAG GTGCCGGTGA AGTCGGAGGA GCGCGCCGCC
GCCTTCCGCA TGCCGGTGCA GTGGGTGAAC CGCCCGAATT CCGACTTCCG CGGCTTCTCG
GGGCTGATCG CCTCGGGCTC CGTCGCGCCG GGCGATGCCG TCACCGTCGC GCCTTCCGGC
AAGACCTCGA CGATCGCCCG CATCTTCACC GCCGACGGCG ATCTGGAACG GGCGAGCGAG
GGCCAGTCGG TGACGCTGGT GCTGGCCGAC GAAGTCGATG CCTCGCGCGG CGCGGTGATC
GCGACCTCGG ACGCACCGTT GACGCTGACC GACAGCCTCG ACGTGCGCCT GTTCTGGGCC
GCCGAATCCG ATCTCGTTCC CGGCGCCAAC CTGTGGGCGA AGGTCGGCAC GCAGACCGTC
AACGCGGTGG TGAAGGCGGT GCACCGCCGG ATCGATCCGG AGACGGGACA GGCCGGTCCG
GCCGACAAGC TCGCGGTCAA CGACATCGGC GACGTGACGC TGACCCTCGA CCGGCAGATC
GCGGTCGATC CCTATGCCGA GAACCGCGAC ACCGGCAGCC TGATCCTGAT CGACCGTGAG
ACGACCGACA CGGCCGCGCT CGGCCTCGTG CAGAGGGTCG TTGCGTCGAG CAAGGTCGCT
CCGGCGCCGA CCGCGTCTGT GACGGCTTCG GCGGAGCCCG CACGTAGCGG CGGTTTGCTG
GCCGGCCTCA AGCGGCTGTT CGGCGGATAA
 
Protein sequence
MTIHQSPEAF GYDAFLRQHQ NKEVLRFITC GSVDDGKSTL IGRLLHDTKQ IFDDQVTALQ 
RDSRKHGTQG GEVDLALLVD GLQAEREQGI TIDVAYRFFS TDRRSFIVAD TPGHEQYTRN
MATGASTADL AVILVDARHG LTRQSRRHAL LVSLLGIRRV ALAINKMDLV GWSQDKFEAI
VSGFQAFAAP LNFTEVRAIP LSAKNGDNVV LPGTAATWYT DVPLLRYLEE VPVKSEERAA
AFRMPVQWVN RPNSDFRGFS GLIASGSVAP GDAVTVAPSG KTSTIARIFT ADGDLERASE
GQSVTLVLAD EVDASRGAVI ATSDAPLTLT DSLDVRLFWA AESDLVPGAN LWAKVGTQTV
NAVVKAVHRR IDPETGQAGP ADKLAVNDIG DVTLTLDRQI AVDPYAENRD TGSLILIDRE
TTDTAALGLV QRVVASSKVA PAPTASVTAS AEPARSGGLL AGLKRLFGG