Gene SeD_A0036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0036 
Symbol 
ID6871789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp37104 
End bp38675 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content47% 
IMG OID642783294 
Product5'-Nucleotidase domain-containing protein 
Protein accessionYP_002213988 
Protein GI198242289 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.0257777 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTTATA TGAACAAAAA GTTTTCGATA TCCCTACTGT CGCTGTGCAT TGGTTTGTCT 
TCAGCCATTT CCTTTTCAGC CGATGCGCGT GACATCACAA TTTATTATAC AAACGATTTA
CATGCCCATG TAACCCCAGA AATTATCCCC TATGTATCCA AGACACGTCC GGTAGGCGGC
TTTGCGCCCA TCTCGAAAAT TGTCAAAGAT GCAAAAGCGA AAGAGAAAGA TGTCTTTTTC
TTTGATGCTG GCGACTATTT CACCGGACCT TTTATCAGTA CGCTGACCAA AGGCGAGGCT
ATTATTGATA TTTTAAATAC CATGCCTTAC GACGCCGTCT CTGTCGGTAA CCATGAATTT
GACCATGGCC ATGAGAATCT GGTTAAACAA CTCAGCAAAT TGCAATTCCC GGTATTGTTG
GATAATGTTT TTTACAGCGG CACAGATACG CCATTAATTA AAGAACCGTA TACCATCGTG
GAAAAAGATG GATTCAAGAT CGGCGTCATC GGTATGCACG GCGTTTCCGC ATTCTATGAA
GCGATTGCCG CAGGCGTGCG TGAAGGCGTT GACTGCCGCG ATCCGATTCC TTATGTGAAA
AAACAGCTGG AAGAGTTAAA AGGGAAAGTT GACCTGACCG TGCTGCTCGC CCACGAAGGC
GTGCCGGGTA TGCAGTCCAG CGCAGGCGAG GCTGATGTCG CACGCGCGCT GAAAACCGAC
GTTGATATGG CGAAATCGCT GGAAGGCTAT GGACTTAACG TCCTGATTAC CGGCCATGCG
CATAAAGGTA CGCCAGAACC GATTAAAGTG GGCGATACCC TTGTCGTTTC CACGGATGCG
TACACCATCG AATTAGGTAA ACTGGTGCTT GACTGGAACC CGGAAACCAA AAAAGTGGAC
AGCTACAATG GTAAGTTGAT CACCATGTAT GCGGATACTT ATAAGCCAGA TCCGGTCACG
CAGGCCAAAA TTGACGAATG GGATAACAAG GTTAAGAAAA TTACCGATGA GGTGGTCGCG
CACTCTCCGG AAGTGCTGAC CCGTTCTTAC GGTGAATCCG CGCCAACCGG CAACTTAATC
ACCGATGCCC TGATGGCTAC CGTTCCTGGC GCCGACGCTT CCTTCTATAA TGCTGGCGGC
ATCCGTACCG AATTGCCAAA AGGCAATATC ACCTATGGTG ATGTGCTGAG TATGTATCCG
TTCACCAACG ATGTCATGAG CATGGAAATC AGCGGTAAAG ACCTGAAATC CATCATGTCA
CACGCTGCCG ATCTGAAAAA CGGTATGCTG CACGTATCTA AAACCGTCCA GTTTAAATAT
GACAGCACCA AACCGCTGGG CCAGCGTATT GTTGGATTTG ATATCAAAGG CAAACCGGTA
GAAGACAATA AACTCTATAC CGTCGCGCTG GACTCCTTTA TCGGTAAAGG TGGTGGCGGA
TTTACCTTCA CTAAAGGTAA AAATATCAAA TATATAGGGA TACAAACCGC ACCGGCGTTG
GTTAACTATA TGAAGCAGGT TAACAATATT CAACCTGACC ACACCATGCG CGTGGATGAT
ATTAGCAAAT AA
 
Protein sequence
MVYMNKKFSI SLLSLCIGLS SAISFSADAR DITIYYTNDL HAHVTPEIIP YVSKTRPVGG 
FAPISKIVKD AKAKEKDVFF FDAGDYFTGP FISTLTKGEA IIDILNTMPY DAVSVGNHEF
DHGHENLVKQ LSKLQFPVLL DNVFYSGTDT PLIKEPYTIV EKDGFKIGVI GMHGVSAFYE
AIAAGVREGV DCRDPIPYVK KQLEELKGKV DLTVLLAHEG VPGMQSSAGE ADVARALKTD
VDMAKSLEGY GLNVLITGHA HKGTPEPIKV GDTLVVSTDA YTIELGKLVL DWNPETKKVD
SYNGKLITMY ADTYKPDPVT QAKIDEWDNK VKKITDEVVA HSPEVLTRSY GESAPTGNLI
TDALMATVPG ADASFYNAGG IRTELPKGNI TYGDVLSMYP FTNDVMSMEI SGKDLKSIMS
HAADLKNGML HVSKTVQFKY DSTKPLGQRI VGFDIKGKPV EDNKLYTVAL DSFIGKGGGG
FTFTKGKNIK YIGIQTAPAL VNYMKQVNNI QPDHTMRVDD ISK