Gene GM21_1865 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1865 
Symbol 
ID8137196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2169086 
End bp2170741 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content63% 
IMG OID644869476 
Productsulfate adenylyltransferase, large subunit 
Protein accessionYP_003021676 
Protein GI253700487 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2895] GTPases - Sulfate adenylate transferase subunit 1 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR02034] sulfate adenylyltransferase, large subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones97 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACACC AATCCGAACT GATCGAGAAG GATATTCTTG CCTACCTGAA GAGCCAGGAG 
GAGAAATCCT TGCTCCGTTT CATCACCTGC GGCAGCGTGG ATGACGGCAA AAGCACCCTG
ATCGGGCGGC TTTTGTGGGA CTCGAAGATG GTGTTCGAGG ACCAGTTGGC GGCGCTTGAG
GCGGACAGCA AGAAGGTGGG GACCCAAGGG GGCGCCATCG ACTATGCCCT GCTTTTGGAC
GGGCTGCAGG CTGAGCGGGA GCAGGGGATA ACCATCGACG TCGCCTATCG CTTCTTCTCC
ACCGACCGCC GCAAGTTCAT CGTCGCCGAC ACCCCGGGCC ACGAGCAGTA CACCCGCAAC
ATGGTGACCG GCGCCTCCAC CGCGAAGGTT GCGGTGATCC TGGTGGACGC CCGCAAGGGG
CTTTTGACCC AGACCCGCCG CCACAGCTAC CTGGTGTCGC TGGTGGGGAT CCGGCACATC
GTCTTGGCCA TCAACAAGAT GGACCTGGTC GACTTCGACG CCGGGAAATT CGCAGCCATC
GAGAGGGACT ACCGGGAATT CGCCGCTCCG CTCGGCTTTA GCTCCATCAC CGCACTCCCG
ATCTCGGCTT TGAACGGCGA CAACATCATC GAGAAGAGCG CCAACACCCC TTGGTATCAG
GGGCCGCCGC TATTGCACTT CCTGGAGACC GTGCAGGTCG AGGACGAACG CGCCGATCAG
GCCTTCCGGC TGCCGGTGCA ATGGGTGAAC CGTCCCAACC TCGATTTCCG CGGGTTCTGC
GGCACCGTCG CCTCCGGCGC CATACGTCCC GGCGACGAAA TCCGGGTCGC CTCTTCGGGG
CTGGTGAGCA AGGTCTCCAG GATCGTCACT ATGAACGGCG ATCTGGAAGA GGCTGTCGCC
GGCCAGGCAG TGACGCTCAC CTTGGAAGAC GAGATCGATA TCAGCCGCGG CGACATGCTG
ACCCGCTCCG ACGCGCCGCC GCTCTACACA CGGCATCCGG AGGCGCAGCT CGTCTGGCTC
CACGACGAGC CGCTGCAGCC CGGGCAGCTG TACCTGGTGA AGACTGCAAC CGGCGTGATT
CCCGGCAGGG TGACCGCTGT CCACTACGCG ACCGACGTCA ACACCCTGGA GCAGAAACAG
GTGGCGACCC TGGGGCTGAA CGAGATCGGC CTGGTGCGGC TGGAGCTGGA CCGCCCGGTC
TCCTTCGACC CGTACCGCGA GAACAGGGAC ACCGGCAGTT TCATCCTCAT CGACCGCTTC
ACCAATGCCA CCGTCGCGGC CGGCATGGTG GTCGAGGCTC TACCGCAGGA CGCAGCCTTG
TTGACCGAGG GAGGGGCAGG AACCGGAAGC TCCTGGGTTC GTCGCATCAG CCTCGGCGAG
TTGGCAGCTA CCAACCTGAA CCTTGTCGAC CTGAGAGAAG AGAAGGGCGC TTTCGTACTC
GACGTACCGC GAACCCTCCT CGAACACCTT GAGAAGGGGA ACCGGCTTCT TTTCAGGCTG
CGCGATCTGG GACAGTTGGA GCCGGTGGCG TTCATAGCCT ACGAGAACTG CCTCGCCTTC
GAGTTCGACC GGACGCCGGA AGGGATAAGC GTGCTGCTTT TCAAGAGGAG CAACAATCCC
CAGAAGAGCT ACGGGGACGA CGGGATAGGG ATCTAG
 
Protein sequence
MAHQSELIEK DILAYLKSQE EKSLLRFITC GSVDDGKSTL IGRLLWDSKM VFEDQLAALE 
ADSKKVGTQG GAIDYALLLD GLQAEREQGI TIDVAYRFFS TDRRKFIVAD TPGHEQYTRN
MVTGASTAKV AVILVDARKG LLTQTRRHSY LVSLVGIRHI VLAINKMDLV DFDAGKFAAI
ERDYREFAAP LGFSSITALP ISALNGDNII EKSANTPWYQ GPPLLHFLET VQVEDERADQ
AFRLPVQWVN RPNLDFRGFC GTVASGAIRP GDEIRVASSG LVSKVSRIVT MNGDLEEAVA
GQAVTLTLED EIDISRGDML TRSDAPPLYT RHPEAQLVWL HDEPLQPGQL YLVKTATGVI
PGRVTAVHYA TDVNTLEQKQ VATLGLNEIG LVRLELDRPV SFDPYRENRD TGSFILIDRF
TNATVAAGMV VEALPQDAAL LTEGGAGTGS SWVRRISLGE LAATNLNLVD LREEKGAFVL
DVPRTLLEHL EKGNRLLFRL RDLGQLEPVA FIAYENCLAF EFDRTPEGIS VLLFKRSNNP
QKSYGDDGIG I