Gene Gura_3828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_3828 
Symbol 
ID5166363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp4469047 
End bp4470894 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content60% 
IMG OID640551310 
Productsulfate adenylyltransferase, large subunit 
Protein accessionYP_001232551 
Protein GI148265845 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0529] Adenylylsulfate kinase and related kinases
[COG2895] GTPases - Sulfate adenylate transferase subunit 1 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00485] translation elongation factor TU
[TIGR02034] sulfate adenylyltransferase, large subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATA GCGTGAGCCC ATCAACAAAT GACCATCCAC CAACGACTAT CGACCATCCA 
CCATCGACCA TCCACCATCC ACCATCGACC GCCTTCACCG TAGTAATAGT CGGTCACGTG
GACCACGGCA AGTCCACCCT TCTCGGGCGC ATCTACGCGG ACACCGATTC CCTTCCCGAT
GGGCAGCTGG AGAAGGTCCG CGCCATCTGC GAGCAACAGG GAAAGGCATT TGAGTACGCC
TTCCTCTTTG ATGCATTTCT CGAAGAGCAG GAGCAGGGGA TTACCATCGA TACCGCCCGC
ACCTTTTTTT CCTGGGAAAA CCGGCAGTAT ATAATAATAG ATGCGCCGGG GCATAAGGAG
TTCCTGAAAA ACATGATTTC GGGCGCTGCC AGGGCCGAGG CGGCGGTGCT GATCATCGAT
GCTGCCGAAG GGGTGCGCGA ACAGTCGCGG CGCCACGGCT ACATGCTCAG TCTCCTGGGG
ATCCGGCAGA TTGCGGTGGT CGTCAACAAG ATGGACTTGG TCGGCTATGA CGAGCAGGTC
TTTAACGCCA TCGTCGAAGA GTACGGGGCA TTCCTGAAGG GGGTGGGCGT CTCGCCGCAG
CAGTTTATCC CAGCCAGCGC CAGGAACGGC GACAACGTGG TGCGGCGCAG TGATGCCATG
CCATGGTATC GGGGGTCTAC GGTATTGGAA AGCCTCGGTC TCTTTGAGAA GCTGCCGACC
GGCGAAGACC TGCCGCTCCG CTTTCCGGTG CAGGATGTCT ACAAGTTCGA CGCCCGCCGC
ATCATTGCCG GCCGGCTTGC TGCCGGAGTC CTGCGGGTCG GCGATACCGT GGTGTTTTCA
CCCTCCAATA AAACCGCCGT CGTCAAGACC ATCGAGGCCT TCAATGTTCC CCATCTCCCA
TTGGCGGCAG TTGCCGGCAA ATCGACCGGC TTTACCCTCG ACGAGCAGAT TTTCGTCGAA
CGGGGCGAGA TCGCCTCGTT AAAAGAGTCG GCGCCGGAGG TGTCGGACCG TTTTCGCGCC
AACCTGTTCT GGATGGGGAA GAACCCCCTT GTTCGCGGCC GCAGGTACGG CTTAAGGCTG
GCAACCTGTG CGGTGGAGAT GGAGCTGGAA ACGATCCATC GCATCATCGA TGCCGCCAGC
CTCGACCCGG TTGCGGTAAA GGACCGCGTT GACCTGAACG ACGTGGCGGA AGTGACCATC
AGGACCAGGA AGCCGATCGC CCTGGACTGC TATGCAGAAT GCGACGTCAC CGGCCGGTTT
GTGGTGGTGG ACGGTTATGA CGTGTGGGGC GGCGGCATAG TCACGGAAGT GCTGAGCGAC
GAGCAGGAAG CGTTCCGCAG GGAAGCGAGG CAGCGCGACA TCGCCTGGCG TCGCGGCGAG
GTTCAGCTCG GCGAACGGGT GCAGCGCAAC GGCCACAATC CCGGCATCAT CCTCTTCACC
GGCGAAAGCG GCACCGGCAA GGCGCGCCTT GCCCGACACC TGGAGCGGCG GCTTTTCAAC
AGCGGTCGGC AGGTGTATCT CCTTGACGGG AAAAACCTGC TATTTGGCCT GGGGCTAGAT
GTAACCGAGG AGGATAAGGA CGAAATGGTG CGCCGTTTCG GCGAAGTGGC GCAGATCCTG
CTCAGGGCGG GGCAGATCGT GGTTTCCACC ACCAATACCT TTGCCCGGGC CGATCACCAG
ATCATCCGCA CACTTGTCCA TCCCCACCAG GTCATTTCCG TGCATATGGC CTTTGCCGAG
GGGGAGGCGC CGGAGAATAC CGATCTGAAC TTTCTCGAGT CTGATGACCT GAAAGAGGCA
GCCAGGCGTA TCGTGGCGAA AATGGAGGAG ATGGGGGTGT TGCTGTGA
 
Protein sequence
MKNSVSPSTN DHPPTTIDHP PSTIHHPPST AFTVVIVGHV DHGKSTLLGR IYADTDSLPD 
GQLEKVRAIC EQQGKAFEYA FLFDAFLEEQ EQGITIDTAR TFFSWENRQY IIIDAPGHKE
FLKNMISGAA RAEAAVLIID AAEGVREQSR RHGYMLSLLG IRQIAVVVNK MDLVGYDEQV
FNAIVEEYGA FLKGVGVSPQ QFIPASARNG DNVVRRSDAM PWYRGSTVLE SLGLFEKLPT
GEDLPLRFPV QDVYKFDARR IIAGRLAAGV LRVGDTVVFS PSNKTAVVKT IEAFNVPHLP
LAAVAGKSTG FTLDEQIFVE RGEIASLKES APEVSDRFRA NLFWMGKNPL VRGRRYGLRL
ATCAVEMELE TIHRIIDAAS LDPVAVKDRV DLNDVAEVTI RTRKPIALDC YAECDVTGRF
VVVDGYDVWG GGIVTEVLSD EQEAFRREAR QRDIAWRRGE VQLGERVQRN GHNPGIILFT
GESGTGKARL ARHLERRLFN SGRQVYLLDG KNLLFGLGLD VTEEDKDEMV RRFGEVAQIL
LRAGQIVVST TNTFARADHQ IIRTLVHPHQ VISVHMAFAE GEAPENTDLN FLESDDLKEA
ARRIVAKMEE MGVLL