Gene SNSL254_A4194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4194 
SymbolgppA 
ID6483778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4088119 
End bp4089600 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content56% 
IMG OID642739448 
Productguanosine pentaphosphate phosphohydrolase 
Protein accessionYP_002043151 
Protein GI194444292 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones96 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCCA CCTCGTTGTA TGCCGCCATT GATCTCGGTT CCAATAGTTT TCATATGCTG 
GTCGTGCGCG AGGCGGCGGG AAGCATCCAG ACGCTGACCC GAATAAAACG TAAGGTGCGT
CTGGCCGCGG GTCTGAACAA CGACAACCAC CTTTCAGCCG AAGCGATGGA ACGCGGCTGG
CAATGCCTGC GTCTGTTTGC TGAACGTTTG CAGGATATCC CGCAGCCGCA AATCCGCGTG
GTTGCCACCG CAACATTGCG TCTCGCCGTT AATGCGGGTG AATTTATCGC GAAAGCGCAG
ACTATCCTTG GTTGTCCGGT GCAGGTTATC AGCGGCGAAG AAGAGGCGCG GCTGATTTAT
CAGGGGGTCG CTCATACCAC CGGCGGCGCA GATCAGCGAC TGGTGGTGGA TATCGGCGGC
GCCAGCACTG AACTGGTTAC CGGCACTGGC GCGCAAACCA CGTCGCTATT TAGCCTGTCG
ATGGGCTGCG TAACGTGGCT TGAACGCTAT TTTAGCGATC GTAATCTGGC GCAAGAAAAC
TTTGATGATG CGGAGAAGGC CGCGCGCGAT GTACTGCGTC CGGTCGCCGA TGAACTGCGT
TTTCATGGCT GGAAGGTCTG CGTGGGTGCC TCCGGCACCG TACAGGCATT GCAGGAAATC
ATGATGGCGC AGGGGATGGA CGAGCGCATT ACGCTCGCCA AACTGCAGCA GCTAAAACAA
CGCGCGATAC AGTGCGGGCG TCTGGAAGAG CTGGAAATCG AAGGCCTGAC GCTGGAGCGC
GCGCTGGTTT TCCCAAGCGG GCTGGCTATT CTGATCGCGA TATTTACCGA GCTGAACATC
CAGAGCATGA CGCTGGCAGG CGGCGCGTTA CGCGAAGGGC TGGTGTATGG GATGTTGCAT
CTGGCGGTAG ATCAGGATAT CCGCAGCCGC ACGCTGCGAA ACATTCAGCG TCGGTTTATC
GTCGATACCG ATCAGGCGAA CCGCGTAGCG AAGCTGGCAG ATAACTTCCT CAAACAGGTA
GAAAATGCCT GGCATATTGA ACCTATCAGT CGTGAACTGT TGCTTAGCGC TTGCCAGTTG
CATGAGATCG GTCTGAGCGT TGATTTTAAA CAGGCGCCCT ATCATGCCGC CTATTTAGTA
CGCCATTTGG ATCTGCCTGG CTATACGCCC GCGCAGAAAA AGTTGCTCGC CACCCTCTTA
CTGAATCAGA CCAATCCGGT CGATCTCTCT TCGCTTCATC AGCAAAACGC GGTACCGCCC
CGTGTTGCGG AACAGCTATG CCGTTTGCTG CGACTGGCGA TTCTTTTTGC CGGTCGCCGT
CGTGACGATC TGGTACCAGA AATTACGCTA CAGGCGCTAA ATGAAAATCT GACGTTAACC
TTGCCTGGCG ACTGGCTGGC GCATCACCCG CTGGGTAAAG AGTTGATTGA TCAGGAAAGC
CAGTGGCAAA GCTATGTACA CTGGCCGCTG GACGTTCGCT AA
 
Protein sequence
MNSTSLYAAI DLGSNSFHML VVREAAGSIQ TLTRIKRKVR LAAGLNNDNH LSAEAMERGW 
QCLRLFAERL QDIPQPQIRV VATATLRLAV NAGEFIAKAQ TILGCPVQVI SGEEEARLIY
QGVAHTTGGA DQRLVVDIGG ASTELVTGTG AQTTSLFSLS MGCVTWLERY FSDRNLAQEN
FDDAEKAARD VLRPVADELR FHGWKVCVGA SGTVQALQEI MMAQGMDERI TLAKLQQLKQ
RAIQCGRLEE LEIEGLTLER ALVFPSGLAI LIAIFTELNI QSMTLAGGAL REGLVYGMLH
LAVDQDIRSR TLRNIQRRFI VDTDQANRVA KLADNFLKQV ENAWHIEPIS RELLLSACQL
HEIGLSVDFK QAPYHAAYLV RHLDLPGYTP AQKKLLATLL LNQTNPVDLS SLHQQNAVPP
RVAEQLCRLL RLAILFAGRR RDDLVPEITL QALNENLTLT LPGDWLAHHP LGKELIDQES
QWQSYVHWPL DVR