Gene SeD_A2344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2344 
Symbolamn 
ID6872115 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2221732 
End bp2223186 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content54% 
IMG OID642785437 
ProductAMP nucleosidase 
Protein accessionYP_002216097 
Protein GI198244603 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0775] Nucleoside phosphorylase 
TIGRFAM ID[TIGR01717] AMP nucleosidase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0273914 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.00120533 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAAAATA AAGGCACAAA CCTAACGCCC GAGCAGGCGC TGGATCGCCT GGAGGAGCTA 
TACGAGCAGT CGGTTAACGC GCTCCGTGAA GCCATCGCTG ACTATGTTGA TAACGGTACG
CTGCCCGATC CCCACGCCAG GCTTAACGGA CTGTTTGTTT ATCCTTCGCT GTCTGTAAGC
TGGGATGGCG CGACACCGAA CCCGCCTAAA ACACGCGCTT TTGGACGCTT TACTCATCCC
GGCTGTTATA CCACTACCGT TACGCGGCCT GCGCTATTTC GCGCTTATCT TCTGGAACAG
CTTAACCTCG TTTACCATGA TTATGGCGCG CATATTGCGG TAGAGGCCTC GCACCATGAG
ATCCCGTATC CTTATGTGAT CGATGGCTCG GCTCTGACGC TGGATCGTTC TATGAGCGCC
GGGCTAACGC GCCATTTTCC TACCACAGAA CTGGCGCAGA TTGGCGATGA GACGGCAGAC
GGTCTGTTCC ACCCCGGCGA ATTCTATCCG CTATCGCACT TCGACGCGCG TCGTGTTGAC
TTCTCGTTGG CCCGGTTACG GCACTATACC GGTACGCCGG TTGAACATTT TCAGCCCTTC
GTTTTGTTTA CCAACTACAC CCGCTATGTC GATGAGTTTG TGCGCTGGGG ATGCAGCCAA
ATTCTTGATC CTGACAGTCC CTATATCGCG CTTTCCTGTG CCGGCGGAAT TTGGATCACG
GCAGAAACGG AAGCGCCGGA AGAGGCTATT TCCGATCTGG CCTGGAAGAA GCATCAGATG
CCCGCCTGGC ACCTGGTAAC GGCAGATGGG CAGGGGATTA CGCTGGTAAA TATCGGTGTC
GGCCCTTCAA ATGCTAAAAC GATTTGCGAC CATTTGGCCG TGTTGCGGCC TGACGTCTGG
CTTATGATTG GGCACTGTGG AGGGTTACGA GAAAGTCAGG CGATTGGTGA TTATGTGCTG
GCCCATGCCT ACTTACGCGA TGATCATGTG CTGGATGCCG TTCTGCCGCC TGATATTCCT
ATTCCAAGCA TTGCGGAAGT ACAGCGTGCG CTGTACGACG CTGCCAAAGC GGTTAGCGGA
ATGCCCGGTG AAGAGGTGAA ACAGCGACTG CGCACGGGAA CGGTCGTCAC CACGGATGAT
CGTAACTGGG AGCTTCGTTA TTCCGCATCG GCGCTGCGAT TCAACCTTAG CCGCGCGGTC
GCGATTGATA TGGAGAGCGC GACGATTGCA GCCCAGGGCT ATCGCTTCCG CGTGCCTTAC
GGCACGCTGC TTTGTGTTTC TGACAAACCT TTACACGGTG AAATTAAACT GCCCGGGCAG
GCAAACCGGT TTTATGAGGG GGCTATTTCA GAACATTTGC AAATTGGTAT TCGTGCTATT
GATTTACTAC GCGCAGAAGG TGATCGTTTG CACTCACGTA AATTACGCAC GTTTAATGAG
CCACCGTTTC GCTAA
 
Protein sequence
MENKGTNLTP EQALDRLEEL YEQSVNALRE AIADYVDNGT LPDPHARLNG LFVYPSLSVS 
WDGATPNPPK TRAFGRFTHP GCYTTTVTRP ALFRAYLLEQ LNLVYHDYGA HIAVEASHHE
IPYPYVIDGS ALTLDRSMSA GLTRHFPTTE LAQIGDETAD GLFHPGEFYP LSHFDARRVD
FSLARLRHYT GTPVEHFQPF VLFTNYTRYV DEFVRWGCSQ ILDPDSPYIA LSCAGGIWIT
AETEAPEEAI SDLAWKKHQM PAWHLVTADG QGITLVNIGV GPSNAKTICD HLAVLRPDVW
LMIGHCGGLR ESQAIGDYVL AHAYLRDDHV LDAVLPPDIP IPSIAEVQRA LYDAAKAVSG
MPGEEVKQRL RTGTVVTTDD RNWELRYSAS ALRFNLSRAV AIDMESATIA AQGYRFRVPY
GTLLCVSDKP LHGEIKLPGQ ANRFYEGAIS EHLQIGIRAI DLLRAEGDRL HSRKLRTFNE
PPFR