Gene SeHA_C4243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4243 
SymbolgppA 
ID6491077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4131716 
End bp4133197 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content56% 
IMG OID642744336 
Productguanosine pentaphosphate phosphohydrolase 
Protein accessionYP_002047934 
Protein GI194449328 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones103 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCCA CCTCGTTGTA TGCCGCCATT GATCTCGGTT CCAATAGTTT TCATATGCTG 
GTCGTGCGCG AGGCGGCGGG AAGCATCCAG ACGCTGACCC GAATAAAACG CAAGGTGCGT
CTGGCCGCGG GTCTGAACAA CGACAACCAC CTTTCAGCCG AAGCGATGGA ACGCGGCTGG
CAATGTCTGC GCCTGTTTGC TGAACGTTTG CAGGATATTC CGCAGCCGCA AATCCGCGTG
GTTGCCACCG CAACATTGCG TCTCGCCGTC AATGCGGGTG AATTTATCGC GAAAGCGCAG
ACTATCCTTG GTTGTCCGGT GCAGGTTATC AGCGGCGAAG AAGAGGCGCG GCTAATTTAT
CAGGGGGTCG CTCATACCAC CGGCGGCGCA GATCAGCGAC TGGTGGTGGA TATCGGCGGC
GCCAGCACTG AACTGGTTAC CGGCACTGGC GCGCAAACCA CGTCGCTGTT TAGCCTGTCG
ATGGGCTGCG TAACGTGGCT TGAACGCTAT TTTAGCGATC GTAATCTGGC GCAAGAAAAC
TTTGATGATG CGGAGAAAGC CGCGCGCGAT GTACTGCGTC CGGTCGCCGA TGAACTGCGT
TTTCATGGCT GGAAGGTCTG CGTGGGTGCC TCCGGCACCG TACAGGCATT GCAGGAAATC
ATGATGGCGC AGGGGATGGA CGAGCGCATT ACGCTCGCCA AACTGCAGCA GCTAAAACAA
CGCGCGATAC AGTGCGGGCG TCTGGAAGAG CTGGAAATCG AAGGCCTGAC GCTGGAGCGC
GCGCTGGTTT TCCCAAGCGG GCTGGCTATT CTGATCGCGA TATTTACCGA GCTGAACATC
CAGAGCATGA CGCTGGCAGG CGGCGCGTTA CGCGAAGGGC TGGTGTATGG GATGTTGCAT
CTGGCGGTAG ATCAGGATAT CCGCAGCCGC ACGCTGCGAA ACATTCAGCG TCGGTTTATC
GTCGATACCG AGCAGGCGAA CCGCGTAGCG AAGCTGGCAG ATAACTTCCT CAAACAGGTA
GAAAATGCCT GGCATATTGA GCCTATCAGT CGTGAACTGT TGCTTAGCGC TTGCCAGTTG
CATGAGATCG GTCTGAGCGT TGATTTTAAA CAGGCGCCCT ATCATGCCGC CTATTTAGTA
CGCCATTTGG ATCTGCCTGG CTATACGCCC GCGCAGAAAA AGTTGCTCGC CACCCTCTTA
CTGAATCAGA CCAATCCGGT CGATCTCTCT TCGCTTCATC AGCAAAACGC GGTACCGCCC
CGTGTTGCGG AACAGCTATG CCGTTTGCTG CGACTGGCGA TTCTTTTTGC CGGTCGCCGT
CGTGACGATC TGGTACCAGA AATTACGCTA CAGGCGCTAA ATGAAAATCT GACGTTAACC
TTGCCTGGCG ACTGGCTGGC ACATCACCCG CTGGGTAAAG AGTTGATTGA TCAGGAAAGC
CAGTGGCAAA GCTATGTACA CTGGCCGCTG GACGTTCGCT AA
 
Protein sequence
MNSTSLYAAI DLGSNSFHML VVREAAGSIQ TLTRIKRKVR LAAGLNNDNH LSAEAMERGW 
QCLRLFAERL QDIPQPQIRV VATATLRLAV NAGEFIAKAQ TILGCPVQVI SGEEEARLIY
QGVAHTTGGA DQRLVVDIGG ASTELVTGTG AQTTSLFSLS MGCVTWLERY FSDRNLAQEN
FDDAEKAARD VLRPVADELR FHGWKVCVGA SGTVQALQEI MMAQGMDERI TLAKLQQLKQ
RAIQCGRLEE LEIEGLTLER ALVFPSGLAI LIAIFTELNI QSMTLAGGAL REGLVYGMLH
LAVDQDIRSR TLRNIQRRFI VDTEQANRVA KLADNFLKQV ENAWHIEPIS RELLLSACQL
HEIGLSVDFK QAPYHAAYLV RHLDLPGYTP AQKKLLATLL LNQTNPVDLS SLHQQNAVPP
RVAEQLCRLL RLAILFAGRR RDDLVPEITL QALNENLTLT LPGDWLAHHP LGKELIDQES
QWQSYVHWPL DVR