Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4401 |
Symbol | |
ID | 6483466 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 4273902 |
End bp | 4274861 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642739640 |
Product | putative sugar-binding domain protein |
Protein accession | YP_002043334 |
Protein GI | 194442565 |
COG category | [K] Transcription |
COG ID | [COG2390] Transcriptional regulator, contains sigma factor-related N-terminal domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 74 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGATA ATACGTTGGT ATCTGATTAT GGAATGTGCG AAGAAGAGCA GGTGGCGCGT ATTGCCTGGT TCTACTATCA CGATGGATTG ACGCAGAGTG AAATCAGCGA GCGTCTGGGG CTAACCCGGC TAAAGGTTTC TCGTCTGCTG GAGAAAGGGC ATCAGTCCGG TATTATTCGC GTACAAATCA ACTCCCGCTT CGAAGGGTGT CTTGAGTATG AAAATGCCTT GCGCAACCAC TTCGCATTGC AGAATATCCG CGTGCTGCCG GCATTACCCG ATGCCGATAT TGGTCTGCGC TTAGGAATCG GCGCCGCCCA TATGCTGATG GAGTCACTGC GGCCACAGCA ACTGCTGGCC GTCGGCTTTG GCGAAGCCAC GATGACCACA TTAAAACGCC TCAGCGGATT TATCTCGGCG CAACAAATCC GACTGGTCAC GTTATCCGGC GGCGTGGGGC CGTATATGAC CGGAATAGGC CAGCTTGATG CCGCTTGTAG CGTAAGTATT ATGCCCGCGC CGCTGCGCGC ATCATCGCAG GAAATTGCCT GCACGCTGCG CAATGAAAAT AGCGTGCGGG ATGTGATGCT CACAGCGCAA GCTGCCGATG CCGCCATCGT GGGGATTGGG GCAATTAACC AGAAAGATCA AGCCAGTATC TTAAAATCCG GCTATATCAC TCAGGGTGAA CAACTCATGA TTGGCCGCAA AGGCGCAGTA GGCGATATTC TGGGCTATTT TTTTGATGCT CATGGCGAAA TTATTCCAGA CATCAAAATC CATAACGAAT TAATTGGCCT GAAGTTAAAT TCACTTTCCA CGATCCCAAC CGTGATTGGC GTCGCCGGCG GCGAACAAAA AGCAGAAGCT ATTATTGCCG CTATGCGCGG TAACTATATC AATGCGCTGG TTACCGATCA GAAAACCGCA GGGAAAATAA TTCAACTTAT TGAAAAATAA
|
Protein sequence | MSDNTLVSDY GMCEEEQVAR IAWFYYHDGL TQSEISERLG LTRLKVSRLL EKGHQSGIIR VQINSRFEGC LEYENALRNH FALQNIRVLP ALPDADIGLR LGIGAAHMLM ESLRPQQLLA VGFGEATMTT LKRLSGFISA QQIRLVTLSG GVGPYMTGIG QLDAACSVSI MPAPLRASSQ EIACTLRNEN SVRDVMLTAQ AADAAIVGIG AINQKDQASI LKSGYITQGE QLMIGRKGAV GDILGYFFDA HGEIIPDIKI HNELIGLKLN SLSTIPTVIG VAGGEQKAEA IIAAMRGNYI NALVTDQKTA GKIIQLIEK
|
| |