Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C4636 |
Symbol | |
ID | 6490884 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 4526936 |
End bp | 4528438 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642744704 |
Product | proline/glycine betaine transporter |
Protein accession | YP_002048281 |
Protein GI | 194448568 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00883] metabolite-proton symporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 76 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAAAA GGAAAAAAAT AAAACCGATT ACACTGGGCG ATGTGACCAT CATTGATGAT GGTAAACTTC GCAAAGCGAT TACCGCCGCC TCGCTGGGCA ACGCGATGGA GTGGTTTGAT TTTGGTGTTT ATGGATTTGT TGCCTACGCG TTGGGTAAAG TCTTTTTCCC CGGCGCCGAT CCCAGCGTCC AGATGATTGC CGCGCTGGCC ACGTTTTCCG TTCCCTTCCT GATTCGTCCG CTCGGCGGGT TATTCTTTGG TATGCTCGGC GATAAATACG GGCGCCAGAA GATCCTGGCG ATCACGATTG TGATTATGTC GATCAGTACC TTCTGTATCG GGTTAATCCC CTCTTACGCG ACGATCGGTA TCTGGGCGCC AATACTGTTG TTGCTGTGTA AAATGGCGCA GGGCTTCTCG GTTGGCGGGG AATATACCGG CGCGTCGATC TTTGTCGCGG AATATTCGCC GGATCGTAAA CGCGGATTTA TGGGAAGCTG GCTGGATTTT GGTTCCATCG CCGGGTTCGT GCTGGGCGCG GGCGTGGTGG TCTTGATCTC AACGATTGTC GGCGAGGAGA ATTTCCTTGA GTGGGGCTGG CGTATTCCGT TCTTTATCGC CCTGCCATTG GGGATTATTG GTCTCTACTT ACGCCATGCG CTGGAGGAGA CGCCAGCGTT TCAGCAGCAC GTGGATAAAC TGGAGCAGGG CGACCGCGAA GGGTTGCAGG ATGGGCCGAA AGTCTCCTTT AAAGAAATTG CCACCAAACA CTGGCGTAGC CTGTTGTCAT GTATCGGTCT GGTGATTGCC ACCAACGTGA CCTACTACAT GCTGCTCACC TACATGCCGA GCTACCTGTC GCATAACCTG CACTATTCTG AAGATCACGG CGTGTTGATT ATCATCGCCA TTATGATCGG GATGCTGTTT GTGCAGCCGG TGATGGGGCT GCTGAGCGAC CGTTTCGGTC GACGTCCATT TGTGATTATG GGCAGCATTG CGCTGTTCGC GCTGGCGATC CCGGCCTTCA TCCTGATTAA CAGTAACGTT ATTGGCCTGA TTTTTGCCGG TTTGTTGATG CTGGCGGTGA TTCTGAACTG CTTTACCGGG GTGATGGCCT CGACATTGCC GGCGATGTTT CCGACGCATA TTCGTTACAG CGCGCTGGCG GCGGCTTTTA ATATCTCTGT ATTGATTGCC GGTCTGACGC CAACGCTGGC GGCCTGGCTG GTGGAAAGCT CGCAGGATCT GATGATGCCG GCGTATTATT TGATGGTCAT CGCGGTGATA GGCTTGATTA CCGGTATTTC CATGAAAGAG ACGGCCAATC GTCCGTTAAA AGGCGCAACG CCAGCGGCGT CGGACATCCA GGAAGCGAAG GAAATTCTGG GCGAGCATTA CGATAATATT GAGCAGAAAA TCGACGACAT CGATCAGGAA ATTGCGGAGC TGCAGGTCAA ACGTTCGCGT CTGGTACAGC AACATCCGCG TATCGATGAA TAA
|
Protein sequence | MLKRKKIKPI TLGDVTIIDD GKLRKAITAA SLGNAMEWFD FGVYGFVAYA LGKVFFPGAD PSVQMIAALA TFSVPFLIRP LGGLFFGMLG DKYGRQKILA ITIVIMSIST FCIGLIPSYA TIGIWAPILL LLCKMAQGFS VGGEYTGASI FVAEYSPDRK RGFMGSWLDF GSIAGFVLGA GVVVLISTIV GEENFLEWGW RIPFFIALPL GIIGLYLRHA LEETPAFQQH VDKLEQGDRE GLQDGPKVSF KEIATKHWRS LLSCIGLVIA TNVTYYMLLT YMPSYLSHNL HYSEDHGVLI IIAIMIGMLF VQPVMGLLSD RFGRRPFVIM GSIALFALAI PAFILINSNV IGLIFAGLLM LAVILNCFTG VMASTLPAMF PTHIRYSALA AAFNISVLIA GLTPTLAAWL VESSQDLMMP AYYLMVIAVI GLITGISMKE TANRPLKGAT PAASDIQEAK EILGEHYDNI EQKIDDIDQE IAELQVKRSR LVQQHPRIDE
|
| |