Gene SeHA_C1865 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C1865 
SymbolmppA 
ID6492264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp1819810 
End bp1821423 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content49% 
IMG OID642742078 
Productperiplasmic murein peptide-binding protein 
Protein accessionYP_002045723 
Protein GI194448774 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.529105 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value0.987558 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCATT CTGTTTCAGT AACCTGTTGT GCGCTGTTGG TTAGCAGCTT TTCCCTGGCA 
TATGCTGCGG ATGTTCCCGG CGGAACGGTA CTGGCAGAGA AACAAGAACT GGTGCGTCAT
ATCAAAGATG AGCCCGCTTC GTTAGATCCA GCGAAAGCCG TTGGGCTGCC TGAAATTCAG
GTGATCCGCG ATCTGTTTGA AGGGCTGGTT AACCAGAACG AAAAGGGTGA AATTATCCCT
GGCGTCGCCA GCCAGTGGAA GAGCAATGAT AATCGTATCT GGACGTTTAC GTTGCGGGAT
AATGCGCAAT GGGCCGATGG AACGCCGGTG ACCGCCCAGG ATTTTGTCTA TAGCTGGCAA
CGCCTCGTTG ATCCCAAAAC GCTTTCCCCC TTCGCCTGGT TTGCCGCGCT GGCTGGCATC
ACTAATGCGC AAGCCATTAT CGACGGTAAA GTCACGCCGG ATCAGCTTGG CGTTAGTGCC
GTGGACGCGC ACACTTTGCG TGTTCAGCTT GACAAGCCGT TGCCCTGGTT TGCCAGTCTG
ACCGCCAGTT TTGCCTTTTA TCCGGTCCAA AAAGCGAATG TTGAAAGCGG CAAAGACTGG
ATGAAGCCGG GAAAACTGAT TGGCAATGGC GCGTATGTGC TTAAAGAGCG CGTGGTAAAT
GAAAAACTGG TGGTCGTGCC TAATACGCAT TACTGGGATA ACGCGAAAAC GGTACTGCAA
AAAGTGACAT TTTTACCCAT TAACCAGGAA TCGGCTGCGA CGAAACGTTA TCTTGCCGGT
GATATTGATA TCACCGAATC TTTCCCTAAA AATATGTACC AGAAATTATT AAAGGATATT
CCAGGGCAAG TTTATACGCC GCCGCAATTA GGGACTTATT ATTATGCGTT TAACACGCAG
AAAGGGCCGA CGGCGGATTC CCGCGTTCGT CTGGCGCTAA GTATGACCAT TGATCGCCGT
TTGATGGCGG AAAAAGTCTT AGGTACCGGT GAAAAACCGG CCTGGCATTT TACACCGGAT
GTCACGGCAG GATTTAAGCC CGATCCTTCA CCGTTTGAAC AAATGAGCCA GGAAGAACTT
AACGCCCAGG CGAAAACATT GCTGCGTGCA GCAGGCTACG GATCGCAGAA GCCGCTTAAA
TTAACTCTGC TTTACAATAC CTCAGAAAAC CATCAGAAAA TCGCGATTGC GGTGGCGTCA
ATGTGGAAGA AAAATCTGGG GGTGGATGTG AAATTGCAAA ACCAGGAGTG GAAAACGTAT
ATCGACAGCC GGAATACCGG TAATTTTGAT GTTATTCGCG CCTCCTGGGT GGGTGATTAC
AACGAACCGT CGACTTTCTT ATCCTTATTA ACGTCCACGC ATACGGGGAA TATTTCACGC
TTTACTAATC CGACTTATGA CAAAATCCTG ACGCAAGCGA CGATGGAAAA TACCGCCGAA
GCGCGTAACG CGGATTACAA TGCAGCGGAG AAAATTTTAA CGGAACAAGC GCCTATAGCG
CCTATTTATC AGTATACCAA TGGCCGGTTA ATTAAACCGT GGGTAAAGGG ATACCCCATT
ACTAACCCGG AAGATGTGGC CTATAGCCGT ACAATGTATA TCGTGAAGCA CTGA
 
Protein sequence
MRHSVSVTCC ALLVSSFSLA YAADVPGGTV LAEKQELVRH IKDEPASLDP AKAVGLPEIQ 
VIRDLFEGLV NQNEKGEIIP GVASQWKSND NRIWTFTLRD NAQWADGTPV TAQDFVYSWQ
RLVDPKTLSP FAWFAALAGI TNAQAIIDGK VTPDQLGVSA VDAHTLRVQL DKPLPWFASL
TASFAFYPVQ KANVESGKDW MKPGKLIGNG AYVLKERVVN EKLVVVPNTH YWDNAKTVLQ
KVTFLPINQE SAATKRYLAG DIDITESFPK NMYQKLLKDI PGQVYTPPQL GTYYYAFNTQ
KGPTADSRVR LALSMTIDRR LMAEKVLGTG EKPAWHFTPD VTAGFKPDPS PFEQMSQEEL
NAQAKTLLRA AGYGSQKPLK LTLLYNTSEN HQKIAIAVAS MWKKNLGVDV KLQNQEWKTY
IDSRNTGNFD VIRASWVGDY NEPSTFLSLL TSTHTGNISR FTNPTYDKIL TQATMENTAE
ARNADYNAAE KILTEQAPIA PIYQYTNGRL IKPWVKGYPI TNPEDVAYSR TMYIVKH