Gene SeSA_A4125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A4125 
SymbolgppA 
ID6515780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp4000487 
End bp4001968 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content56% 
IMG OID642749093 
Productguanosine pentaphosphate phosphohydrolase 
Protein accessionYP_002116849 
Protein GI194737882 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.353078 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCCA CCTCGTTGTA TGCCGCTATT GATCTCGGTT CCAATAGTTT TCATATGCTG 
GTCGTGCGCG AGGCGGCGGG AAGCATCCAG ACGCTGACCC GAATAAAACG CAAGGTGCGT
CTGGCCGCGG GTCTGAACAA CGACAACCAC CTTTCAGCCG AAGCGATGGA ACGCGGCTGG
CAATGCCTGC GTCTGTTTGC TGAACGTTTG CAGGATATTC CGCAGCCGCA AATCCGCGTG
GTTGCCACCG CAACATTGCG TCTCGCCGTC AATGCAGGGG AATTTATCGC GAAAGCGCAG
ACTATCCTTG GTTGTCCGGT GCAGGTTATC AGCGGCGAAG AAGAGGCGCG GCTAATTTAT
CAGGGGGTCG CTCATACCAC CGGCGGCGCA GATCAGCGGC TGGTGGTGGA TATCGGCGGC
GCCAGCACTG AACTGGTTAC CGGCACTGGC GCGCAAACCA CGTCGCTGTT TAGCCTGTCG
ATGGGCTGCG TAACGTGGCT TGAACGCTAT TTTAGCGATC GTAATCTGGC GCAAGAAAAC
TTTGATGATG CGGAGAAGGC CGCGCGCGAT GTACTGCGTC CGGTCGCCGA TGAACTGCGT
TTTCATGGCT GGAAGGTCTG CGTGGGTGCC TCCGGCACCG TACAGGCATT GCAGGAAATC
ATGATGGCGC AGGGGATGGA CGAGCGCATT ACGCTCGCCA AACTGCAGCA GCTAAAACAA
CGCGCGATAC AGTGCGGGCG TCTGGAAGAG CTGGAAATCG AAGGCCTGAC GCTGGAACGC
GCGCTGGTTT TCCCAAGTGG GCTGGCTATT CTGATCGCGA TATTTACCGA GCTGAACATC
CAGAGCATGA CGCTGGCAGG CGGCGCGTTA CGCGAAGGGC TGGTGTATGG GATGTTGCAT
CTGGCGGTAG ATCAGGATAT CCGCAGCCGC ACGCTGCGAA ACATTCAGCG TCGGTTTATC
GTCGATACCG ATCAGGCGAA CCGCGTAGCG AAGCTGGCAG ATAACTTCCT CAAACAGGTA
GAAAATGCTT GGCATATTGA GCCTATCAGT CGTGAACTGT TGCTTAGCGC TTGCCAGTTG
CATGAGATCG GTCTGAGCGT TGATTTTAAA CAGGCGCCCT ATCATGCCGC CTATTTAGTA
CGCCATTTGG ATCTGCCTGG CTATACGCCC GCGCAGAAAA AGTTGCTCGC CACCCTCTTA
CTGAATCAGA CCAATCCGGT CGATCTCTCT TCGCTTCATC AGCAAAACGC GGTACCGCCC
CGTGTTGCGG AACAGCTATG CCGTTTGCTG CGCCTGGCGA TTCTTTTTGC CGGTCGCCGT
CGTGACGATC TGGTACCAGA AATTACGCTA CAGGCGCTAA ATGAAAATCT GACGTTAACC
TTGCCTGGCG ACTGGCTGGC GCATCACCCG CTGGGTAAAG AGTTGATTGA TCAGGAAAGC
CAGTGGCAAA GCTATGTACA CTGGCCGCTG GACGTTCGCT AA
 
Protein sequence
MNSTSLYAAI DLGSNSFHML VVREAAGSIQ TLTRIKRKVR LAAGLNNDNH LSAEAMERGW 
QCLRLFAERL QDIPQPQIRV VATATLRLAV NAGEFIAKAQ TILGCPVQVI SGEEEARLIY
QGVAHTTGGA DQRLVVDIGG ASTELVTGTG AQTTSLFSLS MGCVTWLERY FSDRNLAQEN
FDDAEKAARD VLRPVADELR FHGWKVCVGA SGTVQALQEI MMAQGMDERI TLAKLQQLKQ
RAIQCGRLEE LEIEGLTLER ALVFPSGLAI LIAIFTELNI QSMTLAGGAL REGLVYGMLH
LAVDQDIRSR TLRNIQRRFI VDTDQANRVA KLADNFLKQV ENAWHIEPIS RELLLSACQL
HEIGLSVDFK QAPYHAAYLV RHLDLPGYTP AQKKLLATLL LNQTNPVDLS SLHQQNAVPP
RVAEQLCRLL RLAILFAGRR RDDLVPEITL QALNENLTLT LPGDWLAHHP LGKELIDQES
QWQSYVHWPL DVR