Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C1878 |
Symbol | |
ID | 6489253 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 1831467 |
End bp | 1833140 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642742091 |
Product | peptide transport periplasmic protein SapA |
Protein accession | YP_002045736 |
Protein GI | 194448736 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 0.00804887 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGAACTTCA AAAACTTAAC TATTATGCGC CTGGTTTTAT CGTCTCTGAT CGTGATAGCG GGTCTACTGA GTAGTCAGGC TACGGCTGCG ACTACACCCG AACAAACTGC GAGTGCAGAT ATTCGCGATA GCGGCTTTGT GTATTGTGTC AGCGGGCAGG TCAACACCTT TAATCCGCAA AAAGCGAGTA GCGGCCTCAT CGTCGATACC CTGGCCGCCC AGTTATATGA CCGTCTGTTG GATGTCGATC CCTATACTTA TCGTTTAGTC CCAGAGCTGG CAGAAAGCTG GGAAGTGCTG GATAACGGGG CAACGTACCG TTTTCACCTG CGCCGCGACG TTTCCTTTCA AAAAACCGCC TGGTTTACGC CGACCCGAAA ACTCAATGCT GATGATGTCG TCTTTACCTT TCAGCGGATT TTCGATCGTC GGCATCCGTG GCATAACATC AACGGCAGTA GCTTCCCCTA CTTTGATAGC CTACAGTTCG CCGACAATGT AAAAAGCGTG CGTAAGCTGG ACAATAACAC CGTTGAGTTT CGCCTGACGC AGCCAGACGC CTCCTTTTTA TGGCATCTGG CCACACACTA CGCTTCCGTC ATGTCCGCTG AGTACGCCGC GCAGCTTAGC CGAAAAGATC GTCAGGAACT GCTAGACCGC CAACCGGTCG GCACCGGGCC TTTCCAGCTT TCGGAGTACC GTGCCGGACA GTTTATTCGT CTCCAGCGCC ACGATGGGTT TTGGCGCGGC AAACCGCTGA TGCCGCAAGT GGTAGTGGAT TTAGGCTCCG GCGGTACCGG GCGTTTATCG AAATTACTGA CCGGTGAATG CGATGTTCTG GCCTGGCCCG CCGCCAGCCA GCTAACTATT TTACGCGACG ATCCGCGTTT GCGCCTGACG TTGCGCCCGG GGATGAATAT CGCCTATCTG GCCTTTAACA CCGATAAGCC GCCGTTGAAT AATCCCGCAG TGCGCCATGC GCTGGCCTTA TCGATCAACA ACCAGCGTCT GATGCAGTCG ATTTATTACG GCACGGCGGA AACCGCAGCC TCCATTTTAC CGAGAGCCTC ATGGGCTTAC GATAACGATG CCAAAATTAC GGAGTACAAT CCGCAAAAAT CGCGCGAACA GCTAAAAGCG CTGGGCATTG AGAATCTTAC GCTGCATCTC TGGGTGCCGA CCAGTTCTCA GGCCTGGAAC CCAAGTCCGC TAAAAACGGC GGAGCTTATT CAGGCGGATA TGGCGCAGGT TGGCGTAAAA GTGGTCATTG TGCCGGTTGA AGGTCGTTTT CAGGAGGCGC GCCTGATGGA TATGAATCAC GATCTGACCT TATCCGGCTG GGCCACGGAC AGCAACGATC CGGATAGCTT TTTCAGACCA CTGTTAAGCT GTGCGGCCAT CAATTCGCAA ACCAATTTCG CCCACTGGTG TAACCCTGAA TTTGACAGCG TGCTGCGTAA AGCACTGTCG TCGCAGCAGT TGGCTTCGCG CATAGAAGCA TATGAGGAAG CGCAGAATAT CCTGGAGAAA GAGCTGCCGA TACTGCCGCT GGCATCATCA CTACGCCTGC AGGCTTACCG CTACGATATT AAAGGGCTGG TGTTAAGCCC GTTCGGCAAT GCGTCTTTTG CCGGCGTCTC CCGCGAAAAA CACGAAGAGG TGAAAAAACC ATGA
|
Protein sequence | MNFKNLTIMR LVLSSLIVIA GLLSSQATAA TTPEQTASAD IRDSGFVYCV SGQVNTFNPQ KASSGLIVDT LAAQLYDRLL DVDPYTYRLV PELAESWEVL DNGATYRFHL RRDVSFQKTA WFTPTRKLNA DDVVFTFQRI FDRRHPWHNI NGSSFPYFDS LQFADNVKSV RKLDNNTVEF RLTQPDASFL WHLATHYASV MSAEYAAQLS RKDRQELLDR QPVGTGPFQL SEYRAGQFIR LQRHDGFWRG KPLMPQVVVD LGSGGTGRLS KLLTGECDVL AWPAASQLTI LRDDPRLRLT LRPGMNIAYL AFNTDKPPLN NPAVRHALAL SINNQRLMQS IYYGTAETAA SILPRASWAY DNDAKITEYN PQKSREQLKA LGIENLTLHL WVPTSSQAWN PSPLKTAELI QADMAQVGVK VVIVPVEGRF QEARLMDMNH DLTLSGWATD SNDPDSFFRP LLSCAAINSQ TNFAHWCNPE FDSVLRKALS SQQLASRIEA YEEAQNILEK ELPILPLASS LRLQAYRYDI KGLVLSPFGN ASFAGVSREK HEEVKKP
|
| |