Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A1019 |
Symbol | |
ID | 6486435 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 1033680 |
End bp | 1034912 |
Gene Length | 1233 bp |
Protein Length | 410 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642736425 |
Product | hypothetical protein |
Protein accession | YP_002040184 |
Protein GI | 194444733 |
COG category | [S] Function unknown |
COG ID | [COG3214] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 66 |
Fosmid unclonability p-value | 0.871139 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCATTGC CGTACCTTTC TCTTTCCCAG GCCCGTTGTC TTCACCTTGC TGCGCAGGGG CTATTGAAAA AGCCGCGCCG TAACGCGATG CCTGGCGATG TTCTTGCCGC CATCTCACGC ATGGCGTTGC TGCAAATTGA TACCATCAAT GTTGTCGCAC GTAGCCCCTA TCTGGTGCTG TTTAGCCGTC TCGGCTCGTA CCCGCAGGCC TGGCTGGATG AGGCGCTGCG ACGCGGCGAG TTAATGGAAT ACTGGGCGCA TGAGGCCTGT TTCTTACCAC GCCGTGACTT TAAACTTATC CGCCATCGTA TGCTGTCGCC GGAAAAGATG GGCTGGAAAT ATCGCGCGGC ATGGATGCAT GAGCACGCGG AAGAAATAGA ACAGCTAATG CGGCATATTC AGGAGCACGG CCCGGTGCGA TCTGCCGATT TTGAACATGC GCAGAAAGGC GCCAGCGGCT GGTGGGAATG GAAACCACAT AAACGCCACC TTGAGGGTTT ATTTACCGCC GGAAAAGTCA TGGTTGTTGA GCGGCGTAAT TTTCAACGTG TATATGATTT AACGCGCCGT GTGATGCCGC ACTGGGATGA TGAACGCGAT GGACTGTCAC AGCCGCAGGC GGAAAGCCTG ATGCTGGATA ATAGCGCGCG CAGTCTGGGG ATTTTCCGTG AACAGTGGCT GGCGGATTAC TACCGCCTGA AACGTCCTGA CCTGAAGGGA TGGCGGGAGA GCCGGGCGGA ACAGCAGCAG ATTATTCCGG TCGAGGTGGA AACGTTGGGG CGGATGTGGC TTCATGCCGA TCTTCTTTCG CAGCTTGAAC CGGCGCTAAA TAACGCCTTA AAGGCGACCC ATAGCGCAGT ACTGTCGCCT TTCGATCCTG TGGTATGGGA TCGCAAGCGG GCAGCGCAGC TCTTCGCATT TAACTATCGG CTGGAATGTT ATACGCCCGC GGCGAAGCGC CAGTACGGTT ATTTTGTGCT GCCGCTATTA TACCAGGGCC GTTTAGTCGG GCGAATGGAT GCCAAAATGC ACCGTAAAAC GGGGGTGCTT GAGGTTATCT CGCTGTATCT GGAGGACGAT ATCCGCCCTG GCGTTAGTCT GCAAAAAGGA ATCTGGCAGG CGATTAGCGC GTTTGCTGCC TGGCAACGGG CATCGCGCGT GACGCTGGGA CAATGTCCGC CAGGCCTGTT TAGCGCCATG CGTCATGGCT GGGAAATAGA CCCTGCACCA TAA
|
Protein sequence | MSLPYLSLSQ ARCLHLAAQG LLKKPRRNAM PGDVLAAISR MALLQIDTIN VVARSPYLVL FSRLGSYPQA WLDEALRRGE LMEYWAHEAC FLPRRDFKLI RHRMLSPEKM GWKYRAAWMH EHAEEIEQLM RHIQEHGPVR SADFEHAQKG ASGWWEWKPH KRHLEGLFTA GKVMVVERRN FQRVYDLTRR VMPHWDDERD GLSQPQAESL MLDNSARSLG IFREQWLADY YRLKRPDLKG WRESRAEQQQ IIPVEVETLG RMWLHADLLS QLEPALNNAL KATHSAVLSP FDPVVWDRKR AAQLFAFNYR LECYTPAAKR QYGYFVLPLL YQGRLVGRMD AKMHRKTGVL EVISLYLEDD IRPGVSLQKG IWQAISAFAA WQRASRVTLG QCPPGLFSAM RHGWEIDPAP
|
| |