Gene SeHA_C2933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C2933 
Symbol 
ID6491505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp2872149 
End bp2873828 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content57% 
IMG OID642743093 
Productputative dipeptide/oligopeptide/nickel ABC-type transport system periplasmic component 
Protein accessionYP_002046717 
Protein GI194450396 
COG category[R] General function prediction only 
COG ID[COG4533] ABC-type uncharacterized transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0000000000113765 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTATCT TCGAGGCGCA TTTTCGCAGG CTGCACGCCC GCTACGGCGC AGGTCAGACG 
CACGAATTAC AGATGCAGGA GATCGCCGCC ATCTTCGGCT GTTCGGTGCG TAATTGTCGT
ATTGCGCTAA AAAAGATGCA TCAGGAAAAA TGGCTCGACT GGCAGCCCCA GCGCGGGCGC
GGCAAGCGCT CACGGCTCCA TCTGTTAACC TCGCCGGAAA AGCTGTTCAG CCAGAACGTC
AATAAGCTGC TGGAGAAGCA GGATTACGGC AACGTGCTGC GGTTTATCGG CAACGACAAG
TATCTGCTGG ATCGCCTGAG CCTGTGGCGC TTTGGGGTAC AGGATAAAAG CAGCGAAACG
CGGGTACGCA TCCCCTACTA TCGCAATCTG GATCCGCTTA ACCCACTCGT CCCCCTGCGG
CGGACCGAAC GCCACCTTCT GCGCCAGTGC CTGAGCGGAC TGACGCGCTA TGACGCCGTT
CAGGGCAGGA TCGTCCCCGA TATCGCCCAC TACTGGACCC ATAACGAGGA CTTTACCCGC
TGGGAGTTCT GGCTAAAATC CACCGCCCGC TTTGCCGATG GCTGTGAGCT GGATGCCAGC
GCCGTTCAGC GCTGCCTGCT CGCCGCCAGC CAGAGCCCGC AGTTCGCGCC GATTTTCAGC
CCAATCAAAA CCATTACCGC TGACGCCCCC TGGCATCTGG TAATTGAAAC GCATCATCCG
GTCAGACGGC TCGACTGCCT GCTCGCCACC CAGCCGACGA TGCTGTTTGA TTACCAGCAC
GGACACATCC GCTGTACCGG CGCTTTCCAC CTGCAGGAGC ACAGCGACAA TTTTATGGTT
TTGCGGCGCA ATCAACACTG GCATCAGGCG CGCCCCGGAC TGGATGAGAT CACCATTTTC
ACCTGGGCTC CGGAGCATAT CAGCATGAGC TTTATTCCCC TGCTGCGGGG CGAAGAGGTG
CAGGATGACC GCCCGCTCAA CGAGCGTAGT CTGGAGCAGA GCTGCTGCTT TGTGCTGCTC
GACGGTGACG GTGCCTTTGC AGATGAGGCC GGAAGACGGT TTATCAATTA CCTGCTGCAA
CCTGTCGAAC TCCTCAGCCA GACGCAGCTT CCCGACGAAT ACGCACGCAT CCTCTCTGTG
GCTCAGGGTA TGCTGCCGCA GTGGAATCAC CGCCCGGTAG ATTTTGGCGG GATCACAGCG
CCGTTTAACC TGCGTCAGCC GGTTATTATC AGCACCTTTC AGCAACCGGA ACTGGTGGAG
CTTGCCGGAG CAATTCGCCG CCTGCTTGAA CGCTGGCATA TTCGCGCCGA AATACGGATC
GACGCATTTG ACCACTTTAA CAACCAGCTG CGCCCTCCCG CGGATATCTG GCTCAGCAAC
TTTATGCTCG ATACCCTTTC GGTTCCGGCA TTTCTGGAAT GGCTAGCTTC CACCGCGCTG
TTTACACGAC TGCCTGAATC CCAGCGACAA AACCTGAACG CGCTGTTACC GACGATTCTA
AACAGCGACG ATGAACAGGC GTTCGCTACC ATTGCCGCCT TTTTCCATGA GATGACCCAC
CAGCGATATG TCATTCCTTT GCTGCATCAC TGGATGGAAT TTGCGACTGA GAAGTCATTT
ACCTGGCGGG ATTTAAATAC GCTGGGATGG CCGGATTTCA GCCAACTTTG GCTCGAATAA
 
Protein sequence
MSIFEAHFRR LHARYGAGQT HELQMQEIAA IFGCSVRNCR IALKKMHQEK WLDWQPQRGR 
GKRSRLHLLT SPEKLFSQNV NKLLEKQDYG NVLRFIGNDK YLLDRLSLWR FGVQDKSSET
RVRIPYYRNL DPLNPLVPLR RTERHLLRQC LSGLTRYDAV QGRIVPDIAH YWTHNEDFTR
WEFWLKSTAR FADGCELDAS AVQRCLLAAS QSPQFAPIFS PIKTITADAP WHLVIETHHP
VRRLDCLLAT QPTMLFDYQH GHIRCTGAFH LQEHSDNFMV LRRNQHWHQA RPGLDEITIF
TWAPEHISMS FIPLLRGEEV QDDRPLNERS LEQSCCFVLL DGDGAFADEA GRRFINYLLQ
PVELLSQTQL PDEYARILSV AQGMLPQWNH RPVDFGGITA PFNLRQPVII STFQQPELVE
LAGAIRRLLE RWHIRAEIRI DAFDHFNNQL RPPADIWLSN FMLDTLSVPA FLEWLASTAL
FTRLPESQRQ NLNALLPTIL NSDDEQAFAT IAAFFHEMTH QRYVIPLLHH WMEFATEKSF
TWRDLNTLGW PDFSQLWLE