Gene GSU2559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2559 
Symbol 
ID2685512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2821751 
End bp2823292 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content66% 
IMG OID637127249 
Productexopolyphosphatase, putative 
Protein accessionNP_953605 
Protein GI39997654 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCCACA ACCGGCTGGC CGCCATCGAT ATCGGGACCA ACTCCATCCG CTGCATCGTG 
GTGGAGGTGA CCAGGAACGG CAAATTTCGC GTTCTGGACG ACGAGAAGGC CACCGTGCGC
CTGGGCGAAG GGATGGCCGC CAGCGGCACC ATCTCGCCGG CGGCCTGGGA GCGTGCCGTT
ACGGCGCTTG GCCGGATGAA GAAGATCGTG GACGGCTACG GGGTCAAGGT GGTGGAGGCG
GTTGCCACCA GCGCCGTGCG CCGTGCCGCA AACGGCGAGG AGTTCATCCG GACCGTGGAG
GAGACGGTCG GGGTCAGGGT GGCGGTGATC AGCGGCGAGG AGGAGGCGGA ACTGGCCGCC
CTCTCGGTGC GGAACCATTT CGACATGGAG GGGGTCCGCT ACGCCATGGT GGACATCGGC
GGCGGCAGCC TTGAAATCGT CACGGCGCTC GGGACCCATA TCGAGGATAT CCACTCCCTG
GAGCTCGGGG CCGTGGTCCT GACCGAACGC TTCGTCCGGA GCGATCCCCC GCGCCAGGCT
GACCTGGACA GGCTGCGCAA ACACGTGCGC GCGTCGCTGA AGGAGTCACT GGGCGCCGAG
TGGGGACACC TCCAGAGCCT GGTGGGCTCC GGCGGCACCA TCACGTCTAT CGCCGCCATG
GTCATGGCCA TGCGGGGGGA GGGGTACGGA TCGGTCCACC GTTACGAGGT GCTCCGGTCG
GAGGTGGTGC ATCTGCTGGC CATGCTCTCC CGCAAGGACC TCAAGGCCCG CCGGGAGGTG
CCGGGGCTCA ACCCGGACCG GGCCGACATC ATCACTGCCG GTGTCACTGT GGTGGACGAG
CTCATGAGGT TCTTCGACGT GAACCTCTTG CGGGTCAATG AGCGGGGAAT CCGGGAAGGG
CTCATCATCA AGGCCCTGCG GACCCACGGC CTGATTCCCG GCATGGAAAC TCCCCTCACC
TGGCGCGAGT CGGTGCTGGA GTTCGCCCGT TCATGCCATG CGGACGAAGA GCATGCCCTC
CAGGTGGCGC GGCTTTCCCT GGAGATATTC GATTCCCTGG AGCCGGTCTA TGGCATGGGC
GAGGGAGCCC GCCGGATACT GGAGGCGGCG GCCATCCTCC ACGACGTGGG CTACTTCATC
AACTATTCAA GCCACCACAA GCATTCATAC CATTTGATCC GCCACGCGGA CCTCTTCGGC
TTCACCCCCC GGGAGCGGGA GCTGATCGCC TCAGCGGCCC GCTACCACCG CAAGGCCCTT
CCCAAGAAGA AGCACGAGTC GTACATGCGT CTGGCGGAGC CGGACCGGCT CCTGGTGGCG
CGCCTGGGCG GCATCCTGCG GCTCGCCGAC GGCCTCGACC GTCGCCGCAA CAGCCTGGTC
TCCGGCCTTA CCTGTTCCCT TTCCGACGGC ACCTTCATCC TTACCCTTGC CAGTCGCGGA
GAAATCTCCG TGGAGCTCTT CGGCGGCAAG ATCAAGGGAG ATCTCTTCGA GGAGGCCTTC
AGTAAGCGCC TCCTGCTGGT GGCCGAAAGC GCCCTCGCCT GA
 
Protein sequence
MTHNRLAAID IGTNSIRCIV VEVTRNGKFR VLDDEKATVR LGEGMAASGT ISPAAWERAV 
TALGRMKKIV DGYGVKVVEA VATSAVRRAA NGEEFIRTVE ETVGVRVAVI SGEEEAELAA
LSVRNHFDME GVRYAMVDIG GGSLEIVTAL GTHIEDIHSL ELGAVVLTER FVRSDPPRQA
DLDRLRKHVR ASLKESLGAE WGHLQSLVGS GGTITSIAAM VMAMRGEGYG SVHRYEVLRS
EVVHLLAMLS RKDLKARREV PGLNPDRADI ITAGVTVVDE LMRFFDVNLL RVNERGIREG
LIIKALRTHG LIPGMETPLT WRESVLEFAR SCHADEEHAL QVARLSLEIF DSLEPVYGMG
EGARRILEAA AILHDVGYFI NYSSHHKHSY HLIRHADLFG FTPRERELIA SAARYHRKAL
PKKKHESYMR LAEPDRLLVA RLGGILRLAD GLDRRRNSLV SGLTCSLSDG TFILTLASRG
EISVELFGGK IKGDLFEEAF SKRLLLVAES ALA