Gene SeD_A2106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2106 
Symbol 
ID6872367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2026234 
End bp2027868 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content54% 
IMG OID642785216 
Productnickel ABC transporter periplasmic nickel-binding protein 
Protein accessionYP_002215881 
Protein GI198243726 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01376] Chlamydial polymorphic outer membrane protein repeat 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.000610577 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTGCGCC GGGAGAAACA AAAACAACGC ATAATCTGGG ACAAATTGAT GATTAAAGGC 
AAACTGGCGC TCGTGACGTG CGCGCTGACT CTGGTATTCA CTTCGTCGCT TTTCGCCGCG
TCGGACACGG CAGACGGTCG TACCCTGAAG CTGGCCATTG GCCCTGAGCC AACGGAAGGC
TTTGATCCTA TGCTGGGCTG GAGCCACGGC AGCTACTTAT TGCTTCATGC GCCGTTGCTT
AAGCAAAACG CCGACATGAG TTGGGGTAAT TTACTGACGG AAAAAGTGGA TACCAGCCCC
GTCGGTAAAA TCTGGACGCT AACCTTAAAG CCCGGGCTGA AATTCTCCGA TGGCTCGCCA
TTAACTGCCG AAGATGTCGT TTTTACATAC AATAAAGCGG CGAAGAGCGG CGGTAAAATT
GACATGGGCA ATTTTAGCCA TGCGCGAGCG CTGGACGCAC GCCGAATTGA GATGACGCTG
AGCCATCCGC AGAGCACCTT TGTGAATGTA CTCGGGTCGT TAGGGATTGT TCCGGCCAGC
CGTTATGATG AAAAAACGTT CGCCCGCGAA CCGATAGGCG CCGGTCCGTA CCGACTGGTC
AGCTTTCAGC CGGGTCAGCA ACTGATCGTT GAAGCTAATC CCTGGTATGC GGGTAAAAAG
AATGACTTTA ACCGGCTGGT TTTTGTCTTT CTGGATGAAG ATAATGCTTA TGCCGCCGCG
CGCAGCGGAC AGTTGGGACT GGTGCGCATT GCCCCTTCTA TGGCGGTGGC CCCGCAGCAG
GATAATCTTA AACTCTGGGT ACGCGATAGC GTTGAAAACC GGGGCATTGT CTTCCCGATG
GTGCCTGCCG GTAAAAAGGA TGCTAACGGT TATCCTGTCG GCAACGATGT GACCGCTGAT
GTCGCTATCA GACGCGCAAT TAACTATGCC ATTAATCGTA AGCAACTGGC GGAACAGGTG
ATGGAAGGCC ATGCGATACC CGCCTATAGC GCGGTGCAGG GATTGCCGTG GCAAAATCCT
TCAGTGATAT TCAGCGATGG CGATATTGCA AAAGCGCGCG CCATCCTGGA AGAGGCTGGC
TGGAAGATAA ACCGTGCGGG CGTGCGTGAA AAAGCGGGTA AAGAAGCGCG TCTGACCTTA
TGGTATGCCA GTGGCGACAG CACCCGACGG GATCTGGCCG AGGCGGTGCG CGCCATGTTG
CAGCCTTTAG GCATTGTCGT CTCGTTGCAA TCGGGAAGCT GGGAAACTGT AGAGCGCCAT
ATGCACGCTA ACCCTACGCT GTTTGGCTGG GGAAGTCTGG ACCCGATGGA ACTCTTCCAT
CACTACAGTG GGAAAGCCGC TGGTGTGGAA TATTATAACC CGGGCTATTA CAGCAACCCT
GCGGTAGAAG CACATCTGAA ACAGGCTATA GATGCGCCTG ACTGGCAAAA GGCGATTCCT
TTCTGGCAGC AGGTTGAGTG GGATGGGAAG CAGGGCGCGG GCGTCCAGGG CGATGCGGCA
TGGGCATGGC TGCTTAATAT TCAGCATACC TATCTGGCCA ACCCCTGTAT TGATCTGGGA
AAAGGCGCGC CAGAAATCCA CGGTAGCTGG TCGGTGTTAA ACAATCTTGA TGACTGGACC
TGGACCTGTC GGTGA
 
Protein sequence
MLRREKQKQR IIWDKLMIKG KLALVTCALT LVFTSSLFAA SDTADGRTLK LAIGPEPTEG 
FDPMLGWSHG SYLLLHAPLL KQNADMSWGN LLTEKVDTSP VGKIWTLTLK PGLKFSDGSP
LTAEDVVFTY NKAAKSGGKI DMGNFSHARA LDARRIEMTL SHPQSTFVNV LGSLGIVPAS
RYDEKTFARE PIGAGPYRLV SFQPGQQLIV EANPWYAGKK NDFNRLVFVF LDEDNAYAAA
RSGQLGLVRI APSMAVAPQQ DNLKLWVRDS VENRGIVFPM VPAGKKDANG YPVGNDVTAD
VAIRRAINYA INRKQLAEQV MEGHAIPAYS AVQGLPWQNP SVIFSDGDIA KARAILEEAG
WKINRAGVRE KAGKEARLTL WYASGDSTRR DLAEAVRAML QPLGIVVSLQ SGSWETVERH
MHANPTLFGW GSLDPMELFH HYSGKAAGVE YYNPGYYSNP AVEAHLKQAI DAPDWQKAIP
FWQQVEWDGK QGAGVQGDAA WAWLLNIQHT YLANPCIDLG KGAPEIHGSW SVLNNLDDWT
WTCR