Gene GSU2975 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2975 
Symbol 
ID2687072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3267264 
End bp3268916 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content65% 
IMG OID637127668 
Productputative manganese-dependent inorganic pyrophosphatase 
Protein accessionNP_954017 
Protein GI39998066 
COG category[C] Energy production and conversion 
COG ID[COG1227] Inorganic pyrophosphatase/exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGC AGATTTACGT CATCGGGCAC CGCAATCCCG ACACCGATTC CATTGCGTCG 
GCCATTGCCT ACGCCCAATT CAAGAAGAAG CAGGGGGTGG CGAACGTGAC CGCCGCCATG
GCCGGCCAGC CGAACCCCCA AACCCGCTAC ATCCTGGAGC GGCTCGGGAT CGAGCCGCCG
GTCTACCTGG CCGATGTAAA TCCCAAGGTG CGCGACGTGC TGAACCGCCG CCCGGTCACC
GCCCGGCCGG AGGTCGCCCT CAGGGACGCT CTGGGGCTCT TTCACCGCCA CGGGATTCGT
GTTCTGCCGG TGGTCGACGC CGAGGGAACC CCGGTGGGGG TGGTTTCGCT TCTGAGGCTG
TCGGAAAAGC ACTTGGTGGC CGGCACCGAC CGCAGGCGGG GTGTCGACAC CTCACTGCGC
TCCCTTGCCG CCTGCCTCGA CGGAACCTTC CTTTCCGGCG GGCCTGCCGA CGAGGTGGAG
CACCTGCACC TGTTCATCGG CGCCATGCTG GAAGAATCCT TTTCCAGCCG GATCGAGGGG
TATGACCCGG CAACGCTCCT GATCATGACC GGCGACCGGC GGAGTATCCA CCAGGCCGCC
ATCGAGCGGG GTGTGCGCCT GCTGGTGGTG ACCGGCGGGC TCCCCATTGC CGACGAGCTG
GTGGCCCGCG CCCGGGAGAA GGGCGTAGTC GTCCTTTCAA CCCCCCATGA TACCGCCACC
GCCGCCTGGC TGGCACGGCT CGCCTCTCCT CTCTCCCTGT TCATGGAGCC CGGCTTCGAA
CGGATCGGCG TGGCCGAGCC ACTGGAGCAC CTGCGGCTCA AGCTCCTCCA TAGTCAGGAG
CCGGCGGTCA TTGCGGTGGA GGAAGACGGC ACCATCGCCG GGGTGGCCAC CAAGTCGTCC
CTGCTGGCGC CGGTCCCCTA CGCATTGATC CTCATGGATC ACAATGAGCT GAGTCAGGCG
GTGCCCGGCG CAGAAACAGT GGATATCCTC GAGGTCATCG ACCATCACAA GCTCGGCAAT
CCGCCCACCA ATCAACCCAT CACCTTCATG GCGGCGCCGG TGGGGAGCAC CTGCACCGTG
GTTGCCTCCC TCTACCGCGA GGCCGGGATC GAGCCGGGCG AGCGGACCGC GGCCCTGCTG
CTTGCCGGCA TCCTCACGGA TACGGTGATC CTCAAATCTC CCACCAGCAC CGTCCGGGAC
CGTGAGATGA TCGCCTGGCT AGAGGAACGG TCCGGGCTGG AACATCTTGC CTTTGGCAAG
GAGATCTTCT CCGCCTGCGG CGGATTTGCC TCCCATGGTA CGCCGGAGCA GGCCCTGCGC
TCCGATTTCA AGCAGTTCAC CGCTGGCGGC ATGCAGTTCG GCGTGGGGCA GGTGGAGGTG
GTGGGCTTCG ACGAGTTTTT CGAGCTGAAG GATGCCCTGC GCGACTGTCT CCGGCGGGTG
AAGGAGGTCG ACCGCCTCGA CCTGGCCGGC CTCATGGTGA CCGACATCTA TACCGAAACC
ACGCTGTTCC TGGCCGAGGG GAAGAACGAG ATCGCCCACG TGATGGGGTA TCCCCAAGTG
GAGCCTCACC TCTATGAGCT CAAGGGGGTC ATGTCCCGCA AGAAGCAGAT GGTTCCCCAC
TTGCTCGGGG TGCTCGGGAA GGTGCAGGCA TGA
 
Protein sequence
MKKQIYVIGH RNPDTDSIAS AIAYAQFKKK QGVANVTAAM AGQPNPQTRY ILERLGIEPP 
VYLADVNPKV RDVLNRRPVT ARPEVALRDA LGLFHRHGIR VLPVVDAEGT PVGVVSLLRL
SEKHLVAGTD RRRGVDTSLR SLAACLDGTF LSGGPADEVE HLHLFIGAML EESFSSRIEG
YDPATLLIMT GDRRSIHQAA IERGVRLLVV TGGLPIADEL VARAREKGVV VLSTPHDTAT
AAWLARLASP LSLFMEPGFE RIGVAEPLEH LRLKLLHSQE PAVIAVEEDG TIAGVATKSS
LLAPVPYALI LMDHNELSQA VPGAETVDIL EVIDHHKLGN PPTNQPITFM AAPVGSTCTV
VASLYREAGI EPGERTAALL LAGILTDTVI LKSPTSTVRD REMIAWLEER SGLEHLAFGK
EIFSACGGFA SHGTPEQALR SDFKQFTAGG MQFGVGQVEV VGFDEFFELK DALRDCLRRV
KEVDRLDLAG LMVTDIYTET TLFLAEGKNE IAHVMGYPQV EPHLYELKGV MSRKKQMVPH
LLGVLGKVQA