Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C1663 |
Symbol | |
ID | 6490256 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | - |
Start bp | 1617468 |
End bp | 1618370 |
Gene Length | 903 bp |
Protein Length | 300 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642741886 |
Product | substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_002045531 |
Protein GI | 194449370 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 84 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGATTCA AAAAACACCT GTTGGGATGG CTTGCCGCAA CGCTGTTGTT CAGTAGCCAG ACGCAGGCCG CGCCGCTGGT TCTCGCCACC AAAAGCTTTA CCGAGCAGCA TATTCTTTCC GCAATGACCG TTCAGTATTT GCAGAAGAAA GGCTTTCAGG TTCAGCCGCA AACCAATATC GCCGCGGTGA TTTCACGTAA TGCGATGGTG AATAAACAAA TTGATATTAC CTGGGAATAC ACCGGTACAT CGCTGATTAT TTTCAACCGT ATCGACAAGC GCATGAGTCC ACAGGAAACC TACGACACGG TAAAACGCCT GGATGCGAAG CTGGGCCTGG TATGGCTCAA ACCGGCTGAC ATGAACAATA CTTACGCGTT CGCGATGCAA CGCAAACGCG CCGAGTCGGA AAATATCACC ACCATCTCGC AAATGGTGGC AAAAATCGAA CAAGTCCGGC AGAACGATCC TGACCACAAC TGGATGCTCG GCCTCGATCT GGAATTTGCC GGGCGCAGCG ACGGGATGAA GCCCCTTCAG CAAGCCTACC AGATGCAGCT CGATCGCCCG CAAATACGAC AGATGGACCC AGGGCTGGTC TATAACGCCG TTCGGGATGG GCTGGTTGAC GCCGGGCTGG TCTATACCAC CGACGGACGG GTGAAAGGGT TTGATCTGAA AGTGCTGGAA GATGATAAAG GCTTCTTTCC AAGTTACGCT GTCACGCCCG TGGTGCGTAA AGAGGTGCTG GAAGCCAATC CTGGCCTTGA TGACGCCTTA AACACCCTTT CCGGCCTGCT CAATAACGAT GTGATATCGA CCCTAAACGC CCAGGTCGAT ATCGAGCATC GCACGCCGCA ACAGGTAGCC CATCAATTTT TGCAGGACAA AGGTCTGCTG TAA
|
Protein sequence | MRFKKHLLGW LAATLLFSSQ TQAAPLVLAT KSFTEQHILS AMTVQYLQKK GFQVQPQTNI AAVISRNAMV NKQIDITWEY TGTSLIIFNR IDKRMSPQET YDTVKRLDAK LGLVWLKPAD MNNTYAFAMQ RKRAESENIT TISQMVAKIE QVRQNDPDHN WMLGLDLEFA GRSDGMKPLQ QAYQMQLDRP QIRQMDPGLV YNAVRDGLVD AGLVYTTDGR VKGFDLKVLE DDKGFFPSYA VTPVVRKEVL EANPGLDDAL NTLSGLLNND VISTLNAQVD IEHRTPQQVA HQFLQDKGLL
|
| |