Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A1604 |
Symbol | |
ID | 6484928 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 1569790 |
End bp | 1570692 |
Gene Length | 903 bp |
Protein Length | 300 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642736990 |
Product | substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_002040742 |
Protein GI | 194446205 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 82 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGATTCA AAAAACACCT GTTGGGATGG CTTGCCGCAA CGCTGTTGTT CAGTAGCCAG ACGCAGGCCG CGCCGCTGGT TCTCGCCACC AAAAGCTTTA CCGAGCAGCA TATTCTTTCC GCTATGACCG TTCAGTATTT GCAGAAGAAA GGCTTTCAGG TTCAGCCGCA AACCAATATC GCCGCGGTAA TTTCACGTAA TGCGATGGTG AATAAACAAA TTGATATTAC CTGGGAATAC ACCGGTACAT CGCTGATTAT TTTCAACCGT ATCGATAAGC GCATGAGTCC ACAGGAAACC TACGACACGG TAAAACGCCT GGATGCGAAG CTGGGCCTGG TATGGCTCAA ACCGGCTGAC ATGAACAATA CTTACGCGTT CGCGATGCAG CGCAAACGCG CCGAGTCGGA AAATATCACC ACCATCTCGC AAATGGTGGC AAAAATCGAA CAGGTCCGGC AGAACGATCC TGACCACAAC TGGATGCTCG GTCTCGATCT GGAATTTGCC GGGCGCAGCG ATGGGATGAA GCCCCTTCAG CAAGCCTACC AAATGCAGCT TGATCGCCCG CAAATACGAC AGATGGACCC AGGGCTGGTC TATAACGCCG TTCGGGATGG GCTGGTTGAC GCCGGGCTGG TCTATACCAC CGACGGACGG GTGAAAGGGT TTGATCTGAA AGTGCTGGAA GATGATAAAG GCTTCTTTCC AAGTTACGCT GTCACGCCCG TGGTGCGTAA AGAGGTGCTG GAAGCCAATC CTGGCCTTGA TGACGCCTTA AACACCCTTT CCGGCCTGCT CAATAACGAT GTGATATCGA CCCTAAACGC CCAAGTCGAT ATCGAGCATC GCACGCCGCA ACAGGTAGCC CATCAATTTT TGCAGGACAA AGGTCTGCTG TAA
|
Protein sequence | MRFKKHLLGW LAATLLFSSQ TQAAPLVLAT KSFTEQHILS AMTVQYLQKK GFQVQPQTNI AAVISRNAMV NKQIDITWEY TGTSLIIFNR IDKRMSPQET YDTVKRLDAK LGLVWLKPAD MNNTYAFAMQ RKRAESENIT TISQMVAKIE QVRQNDPDHN WMLGLDLEFA GRSDGMKPLQ QAYQMQLDRP QIRQMDPGLV YNAVRDGLVD AGLVYTTDGR VKGFDLKVLE DDKGFFPSYA VTPVVRKEVL EANPGLDDAL NTLSGLLNND VISTLNAQVD IEHRTPQQVA HQFLQDKGLL
|
| |