Gene SeHA_C1878 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C1878 
Symbol 
ID6489253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp1831467 
End bp1833140 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content53% 
IMG OID642742091 
Productpeptide transport periplasmic protein SapA 
Protein accessionYP_002045736 
Protein GI194448736 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.00804887 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGAACTTCA AAAACTTAAC TATTATGCGC CTGGTTTTAT CGTCTCTGAT CGTGATAGCG 
GGTCTACTGA GTAGTCAGGC TACGGCTGCG ACTACACCCG AACAAACTGC GAGTGCAGAT
ATTCGCGATA GCGGCTTTGT GTATTGTGTC AGCGGGCAGG TCAACACCTT TAATCCGCAA
AAAGCGAGTA GCGGCCTCAT CGTCGATACC CTGGCCGCCC AGTTATATGA CCGTCTGTTG
GATGTCGATC CCTATACTTA TCGTTTAGTC CCAGAGCTGG CAGAAAGCTG GGAAGTGCTG
GATAACGGGG CAACGTACCG TTTTCACCTG CGCCGCGACG TTTCCTTTCA AAAAACCGCC
TGGTTTACGC CGACCCGAAA ACTCAATGCT GATGATGTCG TCTTTACCTT TCAGCGGATT
TTCGATCGTC GGCATCCGTG GCATAACATC AACGGCAGTA GCTTCCCCTA CTTTGATAGC
CTACAGTTCG CCGACAATGT AAAAAGCGTG CGTAAGCTGG ACAATAACAC CGTTGAGTTT
CGCCTGACGC AGCCAGACGC CTCCTTTTTA TGGCATCTGG CCACACACTA CGCTTCCGTC
ATGTCCGCTG AGTACGCCGC GCAGCTTAGC CGAAAAGATC GTCAGGAACT GCTAGACCGC
CAACCGGTCG GCACCGGGCC TTTCCAGCTT TCGGAGTACC GTGCCGGACA GTTTATTCGT
CTCCAGCGCC ACGATGGGTT TTGGCGCGGC AAACCGCTGA TGCCGCAAGT GGTAGTGGAT
TTAGGCTCCG GCGGTACCGG GCGTTTATCG AAATTACTGA CCGGTGAATG CGATGTTCTG
GCCTGGCCCG CCGCCAGCCA GCTAACTATT TTACGCGACG ATCCGCGTTT GCGCCTGACG
TTGCGCCCGG GGATGAATAT CGCCTATCTG GCCTTTAACA CCGATAAGCC GCCGTTGAAT
AATCCCGCAG TGCGCCATGC GCTGGCCTTA TCGATCAACA ACCAGCGTCT GATGCAGTCG
ATTTATTACG GCACGGCGGA AACCGCAGCC TCCATTTTAC CGAGAGCCTC ATGGGCTTAC
GATAACGATG CCAAAATTAC GGAGTACAAT CCGCAAAAAT CGCGCGAACA GCTAAAAGCG
CTGGGCATTG AGAATCTTAC GCTGCATCTC TGGGTGCCGA CCAGTTCTCA GGCCTGGAAC
CCAAGTCCGC TAAAAACGGC GGAGCTTATT CAGGCGGATA TGGCGCAGGT TGGCGTAAAA
GTGGTCATTG TGCCGGTTGA AGGTCGTTTT CAGGAGGCGC GCCTGATGGA TATGAATCAC
GATCTGACCT TATCCGGCTG GGCCACGGAC AGCAACGATC CGGATAGCTT TTTCAGACCA
CTGTTAAGCT GTGCGGCCAT CAATTCGCAA ACCAATTTCG CCCACTGGTG TAACCCTGAA
TTTGACAGCG TGCTGCGTAA AGCACTGTCG TCGCAGCAGT TGGCTTCGCG CATAGAAGCA
TATGAGGAAG CGCAGAATAT CCTGGAGAAA GAGCTGCCGA TACTGCCGCT GGCATCATCA
CTACGCCTGC AGGCTTACCG CTACGATATT AAAGGGCTGG TGTTAAGCCC GTTCGGCAAT
GCGTCTTTTG CCGGCGTCTC CCGCGAAAAA CACGAAGAGG TGAAAAAACC ATGA
 
Protein sequence
MNFKNLTIMR LVLSSLIVIA GLLSSQATAA TTPEQTASAD IRDSGFVYCV SGQVNTFNPQ 
KASSGLIVDT LAAQLYDRLL DVDPYTYRLV PELAESWEVL DNGATYRFHL RRDVSFQKTA
WFTPTRKLNA DDVVFTFQRI FDRRHPWHNI NGSSFPYFDS LQFADNVKSV RKLDNNTVEF
RLTQPDASFL WHLATHYASV MSAEYAAQLS RKDRQELLDR QPVGTGPFQL SEYRAGQFIR
LQRHDGFWRG KPLMPQVVVD LGSGGTGRLS KLLTGECDVL AWPAASQLTI LRDDPRLRLT
LRPGMNIAYL AFNTDKPPLN NPAVRHALAL SINNQRLMQS IYYGTAETAA SILPRASWAY
DNDAKITEYN PQKSREQLKA LGIENLTLHL WVPTSSQAWN PSPLKTAELI QADMAQVGVK
VVIVPVEGRF QEARLMDMNH DLTLSGWATD SNDPDSFFRP LLSCAAINSQ TNFAHWCNPE
FDSVLRKALS SQQLASRIEA YEEAQNILEK ELPILPLASS LRLQAYRYDI KGLVLSPFGN
ASFAGVSREK HEEVKKP