Gene Gura_4157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_4157 
Symbol 
ID5166309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp4816343 
End bp4817989 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content62% 
IMG OID640551635 
Productputative manganese-dependent inorganic pyrophosphatase 
Protein accessionYP_001232873 
Protein GI148266167 
COG category[C] Energy production and conversion 
COG ID[COG1227] Inorganic pyrophosphatase/exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA AAACCTATGT GATTGGCCAC CGCAACCCCG ATACCGACTC GATAGCCTCG 
GCCATCGCCT ATGCCGAACT GAAGCGGCTG TTGGGCGAAG ACGGGACCGT TGCCGCCATG
GCCGGCGAAC CGAATCCCCA GACCGCCTAT GTCCTGGAAC GGCTCGGCAT TGAGCCGCCG
CTCTACCTGG CCGACGTCCA CCCCAAGGTC AGGGATGTGC TGAACCGGAA GCCGGTGACC
GCCGGTGCCG ACCTGCCGAT GATGCAGGCG CTGGAGCTGT TCCACCGCAA CAATATCCGT
GTCCTGCCGG TGGTGGATGC TGCAGGGAAA CCGAGCGGCA TCGTCTCGCT GCTCAAGCTC
TCGGAGAAAT ACCTCCTGAC GGGCAAGGAC CGGATGCGGG GGGTGAACAC CTCGCTCCGC
TCGCTGGCCA TAAGCCTGGA AGGCGACTTC CTCTGCGGTG CCCCATGCGA TGACCATGAA
CACCTGCACC TCTTCATCGG CGCCATGGAA CAGGAATCAT TCAGCGCCCG GATCGACGGC
TATCCACCCG AATCGCTCCT GATCATCACC GGCGACCGGC GGACCATCCA ACTGGCCGGC
ATCGAAAAGG GGGTGCGGCT CCTGGTGGTG ACCGGCGGAC TGGCGGTGGA GAAGGACCTC
CTGACAAAGG CCGCAGAACT GGGCGTCACG GTCCTTTCGA CCCCTTTCGA TACCGCCACC
GCAGCCTGGC TGACCCGTCT CTCCACGCCG GTGGGGCTGT TCGCTGAAAC AAATTTCGAG
CGGATCGGCG TGGGCGAACC GCTCAGCCAC CTGAAACTGA AACTTCTCCA CAGCGGCGAG
CCGGCGGTGA TCGTCGTGGA GGAGGACGGC ACCCTGGCGG GGATCGCCAC CAAGTCGTCG
CTACTGGCGC CCATCCCCTC CTCCCTGATC CTGGTGGACC ACAATGAACT GGGGCAGGCG
GTTCACGGGG CGGATGAGCT GGAGATCCGC GAGGTCATCG ACCATCACAA GCTCGGGAAC
AGCCACACGA ACCAACCGGT TACCTTCATC ACCGCACCGG TGGGAAGCAC CTGCACGTTG
GTTGCCGGCC TCTACCGCCA TGAAGGGGTT GAACTGGCCC CCTCCTTTGC CGCCCTGCTT
CTGGCCGGCA TCCTCTCGGA TACGGTCATC CTTAAATCAC CGACCACCAC CAATCGGGAC
CGGGAAACGG TTCTCTGGCT TGAGCAACTG GCAGGGCTCG ACCATGGAGA GTTCGGCAAA
GAGATCTTTG CCGCCTGCTC GGGTCTTGCC GGCTACGGCA CCCCGGAAAA GGTCATCACC
ACCGACTTCA AGGTTTTCAC GGGGAGCTCC GCCCGCTTCG GGGTCGGACA GGTGGAGGTG
ATCGGCTTCG ACGAGTTTTT TGCCATGAGG GAGGAACTGA AGGGGGCGCT GGCGGAGCTG
AAGCAACGGG AAGGACTGGA TCTGGCAGGA CTGATGGTGA CCGACATCTA CACGGAAACG
ACCCTCTTTC TCGTGGAAGG AAAAAAGGAG TTGGCCCACG TCATGGGGTA CCCCCAGGTG
GAGGCGCACC TCTACGAGCT GAAGGGGGTC ATGTCGCGGA AAAAGCAGAT GGTGCCGCAC
CTCTTGCAGG TGTTGGAGAA GATCTAG
 
Protein sequence
MKKKTYVIGH RNPDTDSIAS AIAYAELKRL LGEDGTVAAM AGEPNPQTAY VLERLGIEPP 
LYLADVHPKV RDVLNRKPVT AGADLPMMQA LELFHRNNIR VLPVVDAAGK PSGIVSLLKL
SEKYLLTGKD RMRGVNTSLR SLAISLEGDF LCGAPCDDHE HLHLFIGAME QESFSARIDG
YPPESLLIIT GDRRTIQLAG IEKGVRLLVV TGGLAVEKDL LTKAAELGVT VLSTPFDTAT
AAWLTRLSTP VGLFAETNFE RIGVGEPLSH LKLKLLHSGE PAVIVVEEDG TLAGIATKSS
LLAPIPSSLI LVDHNELGQA VHGADELEIR EVIDHHKLGN SHTNQPVTFI TAPVGSTCTL
VAGLYRHEGV ELAPSFAALL LAGILSDTVI LKSPTTTNRD RETVLWLEQL AGLDHGEFGK
EIFAACSGLA GYGTPEKVIT TDFKVFTGSS ARFGVGQVEV IGFDEFFAMR EELKGALAEL
KQREGLDLAG LMVTDIYTET TLFLVEGKKE LAHVMGYPQV EAHLYELKGV MSRKKQMVPH
LLQVLEKI