Gene SeD_A1653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1653 
Symbol 
ID6874976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1594995 
End bp1596608 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content49% 
IMG OID642784797 
Productperiplasmic murein peptide-binding protein 
Protein accessionYP_002215465 
Protein GI198243089 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.0552012 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCATT CTGTTTCAGT AACCTGTTGT GCGCTGTTGG TTAGCAGCTT TTCCCTGGCA 
TATGCTGCGG ATGTTCCCGG CGGAACGGTA CTGGCAGAAA AACAAGAACT GGTGCGTCAT
ATCAAAGATG AGCCCGCTTC GTTAGATCCA GCGAAAGCCG TTGGGCTGCC TGAAATTCAG
GTGATCCGCG ATCTGTTTGA AGGGCTGGTT AACCAGAACG AAAAGGGTGA AATTATCCCT
GGCGTCGCCA GCCAGTGGAA GAGCAATGAT AATCGTATCT GGACGTTTAC GTTGCGGGAT
AATGCGCAAT GGGCCGATGG AACGCCGGTG ACCGCCCAGG ATTTTGTCTA TAGCTGGCAA
CGTCTCGTTG ATCCCAAAAC GCTTTCCCCC TTCGCCTGGT TTGCCGCGCT GGCTGGCATC
ACTAATGCGC AAGCCATTAT CGACGGTAAA GTCACGCCGG ATCAGCTTGG CGTCAGTGCC
GTGGACGCGC ACACTTTGCG TGTTCAGCTT GACAAGCCGT TGCCCTGGTT TGCCAGTCTG
ACCGCCAGTT TTGCCTTTTA TCCGGTCCAA AAAGCGAATG TCGAAAGCGG CAAAGACTGG
ATGAAGCCGG GAAAACTGAT TGGCAATGGC GCGTATGTGC TTAAAGAGCG CGTGGTAAAT
GAAAAACTGG TGGTCGTGCC TAATACGCAT TACTGGGATA ACGCGAAAAC GGTACTGCAA
AAAGTAACAT TTTTACCCAT TAACCAAGAA TCGGCTGCGA CGAAACGTTA CCTTGCCGGT
GATATTGATA TCACCGAATC TTTCCCTAAA AATATGTACC AGAAATTATT GAAGGATATT
CCAGGGCAAG TTTATACGCC GCCGCAATTA GGGACTTATT ATTATGCGTT TAATACGCAG
AAAGGGCCGA CGGCGGATTC CCGCGTTCGT CTGGCGCTAA GTATGACCAT TGATCGCCGT
TTGATGGCGG AAAAAGTCTT AGGTACCGGT GAAAAACCGG CCTGGCATTT TACACCGGAT
GTCACGGCAG GATTTAAGCC CGATCCTTCA CCGTTTGAAC AAATGAGCCA GGAAGAACTT
AACGCCCAGG CGAAAACATT GCTGCGTGCA GCAGGCTACG GATCGCAGAA GCCGCTTAAA
TTAACTCTGC TTTACAATAC CTCAGAAAAC CATCAGAAAA TCGCGATTGC GGTGGCGTCA
ATGTGGAAGA AAAATCTGGG GGTGGATGTG AAATTGCAAA ACCAGGAGTG GAAAACGTAT
ATCGACAGCC GGAATACAGG TAATTTTGAT GTTATTCGCG CCTCCTGGGT GGGTGATTAC
AACGAACCGT CGACTTTCTT ATCCTTATTA ACGTCCACGC ATACGGGGAA TATTTCACGC
TTTACTAATC CGACTTATGA CAAAATCCTG ACGCAAGCGA CGATGGAAAA TACCGCCGAA
GCGCGTAACG CGGATTACAA TGCAGCGGAG AAAATTTTAA CGGAACAAGC GCCTATAGCG
CCTATTTATC AGTATACCAA TGGCCGGTTA ATTAAACCGT GGGTAAAGGG ATACCCCATT
ACTAACCCGG AAGATGTGGC CTATAGCCGT ACAATGTATA TCGTGAAGCA CTGA
 
Protein sequence
MRHSVSVTCC ALLVSSFSLA YAADVPGGTV LAEKQELVRH IKDEPASLDP AKAVGLPEIQ 
VIRDLFEGLV NQNEKGEIIP GVASQWKSND NRIWTFTLRD NAQWADGTPV TAQDFVYSWQ
RLVDPKTLSP FAWFAALAGI TNAQAIIDGK VTPDQLGVSA VDAHTLRVQL DKPLPWFASL
TASFAFYPVQ KANVESGKDW MKPGKLIGNG AYVLKERVVN EKLVVVPNTH YWDNAKTVLQ
KVTFLPINQE SAATKRYLAG DIDITESFPK NMYQKLLKDI PGQVYTPPQL GTYYYAFNTQ
KGPTADSRVR LALSMTIDRR LMAEKVLGTG EKPAWHFTPD VTAGFKPDPS PFEQMSQEEL
NAQAKTLLRA AGYGSQKPLK LTLLYNTSEN HQKIAIAVAS MWKKNLGVDV KLQNQEWKTY
IDSRNTGNFD VIRASWVGDY NEPSTFLSLL TSTHTGNISR FTNPTYDKIL TQATMENTAE
ARNADYNAAE KILTEQAPIA PIYQYTNGRL IKPWVKGYPI TNPEDVAYSR TMYIVKH