Gene SeD_A2871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2871 
Symbol 
ID6873517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2745905 
End bp2747485 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content54% 
IMG OID642785922 
Productexopolyphosphatase 
Protein accessionYP_002216572 
Protein GI198244636 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones81 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACAT CAAATCACTC GAGCAACCAG ACTAACGCTA TGCCAATTTA CGATAAATCC 
CCTCGTCCGC AGGAGTTCGC TGCGGTCGAT CTCGGCTCAA ACAGCTTTCA TATGGTCATT
GCCCGCGTGG TTGACGGCGC AATGCAGATT ATCGGGCGTT TAAAACAGCG CGTCCATCTG
GCGGACGGGC TGGGCGCAGA TAATAAACTC AGCGAAGAAG CCATGGAACG GGGGCTTAGC
TGTCTGTCGC TGTTTGCTGA ACGCTTACAA GGTTTCTCCC CTTCCAGCGT CTGTATCGTA
GGCACCCATA CGTTACGTCA GGCGCAAAAT GCCGCTGATT TTCTCAAACG CGCGGAAAAG
GTTATTCCCT ACCCGATAGA GATTATTTCC GGTAACGAAG AAGCGCGCCT GATTTTTATG
GGCGTAGAAC ATACGCAGCC GGAAAAAGGC CGCAAGCTGG TGATCGATAT CGGCGGCGGG
TCAACAGAGC TGGTCATTGG CGAAAACTTC GAACCCAGGC TGGTTGAAAG CCGTCGTATG
GGCTGCGTGA GCTTCGCGCA GCTCTACTTT CCCGGCGGCG TTATCAATAA AGAAAACTTC
CAGCGCGCCC GAATGGCGGC GGCGCAAAAA CTGGAAACCT TAACCTGGCA GTATCGTATT
CAGGGTTGGA ACGTAGCGAT GGGCGCTTCC GGTACGATTA AGGCCGCTCA TGAAGTTCTC
CTGGCGCTGG GTGAGAAAGA TGGCTTCATT ACGCCGGAGC GCCTCGATAA ACTGAAGTCA
GAAGTGTTGA AGCACCGCTC CTTTAATGCG CTCAGCCTGC CGGGTCTGTC TGAAGAACGA
AAAGCGGTCT TTGTGCCGGG CCTGGCGATT CTGTGCGGCG TTTTTGATGC TCTGGCTATC
CGCGAGCTTC GCCTTTCCGA CGGCGCGTTG CGCGAAGGCG TGCTGTATGA AATGGAAGGC
CGCTTCCGCC ATCAGGATGT TCGCAGCCGT ACCGCAAAAA GTCTGGCCAA TCAATACAAC
ATTGACAGAG AACAGGCCAG ACGCGTGCTG GAAACCACCA TGCAGATGTA CGAGCAGTGG
CAGGCCCAGC AGCCAAAACT GGCGCATCCG CAGCTTGAAG CGTTGCTCCG CTGGGCGGCA
ATGCTGCATG AGGTTGGACT GAATATTAAT CACAGCGGTT TACATCGCCA TTCGGCTTAT
ATTCTGCAAC ACAGCGATTT GCCCGGCTTT AATCAGGAGC AGCAAATGAT GATGGCGACG
CTGGTGCGTT ACCATCGTAA AGCCATAAAA CTGGATGATA TGCCCCGCTT TACGCTGTTT
AAGAAAAAAC AGTATCTGCC GTTAATTCAG CTACTTCGGC TGGGCGTATT ACTGAACAAC
CAGCGGCAGG CGACCACTAC GCCGCCAACG CTGCGACTAA CGACCGATGA CAGCCACTGG
ACGTTATGTT TTCCGCATGA CTGGTTCAGC CAGAATGCGC TGGTACTGCT TGATCTGGAA
AAAGAACAGC AGTACTGGGA AGCTGTAACT GGCTGGCGTC TCAATATTGA GGAAGAAAGC
TCGCCGGAGA TCGCCGCGTA A
 
Protein sequence
MTTSNHSSNQ TNAMPIYDKS PRPQEFAAVD LGSNSFHMVI ARVVDGAMQI IGRLKQRVHL 
ADGLGADNKL SEEAMERGLS CLSLFAERLQ GFSPSSVCIV GTHTLRQAQN AADFLKRAEK
VIPYPIEIIS GNEEARLIFM GVEHTQPEKG RKLVIDIGGG STELVIGENF EPRLVESRRM
GCVSFAQLYF PGGVINKENF QRARMAAAQK LETLTWQYRI QGWNVAMGAS GTIKAAHEVL
LALGEKDGFI TPERLDKLKS EVLKHRSFNA LSLPGLSEER KAVFVPGLAI LCGVFDALAI
RELRLSDGAL REGVLYEMEG RFRHQDVRSR TAKSLANQYN IDREQARRVL ETTMQMYEQW
QAQQPKLAHP QLEALLRWAA MLHEVGLNIN HSGLHRHSAY ILQHSDLPGF NQEQQMMMAT
LVRYHRKAIK LDDMPRFTLF KKKQYLPLIQ LLRLGVLLNN QRQATTTPPT LRLTTDDSHW
TLCFPHDWFS QNALVLLDLE KEQQYWEAVT GWRLNIEEES SPEIAA