Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A2922 |
Symbol | |
ID | 6484727 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 2857924 |
End bp | 2859894 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642738239 |
Product | hypothetical protein |
Protein accession | YP_002041968 |
Protein GI | 194446116 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01760] phage tail tape measure protein, TP901 family, core region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0826266 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0000000000000799042 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAACAGT TAGATTTTAC ATTAAGCCTG ATTGATAAGC TGTCCCGCCC GTTAAAACAG GCACAGAGCA GCGTCACCGG CTTTGCGGAA AAATCAAAAG CGGCCTTTAT GCAGATTGGC GGTGGTGTGC TGGCTTTAGC GGGTACAGGA ATGGCCATAC GGGGTGCGTT ATCACCGGCA ATTGAAATGT ATGATGCGCT GAATGATGCA GCATCAAAAG GGATTGATGA TCAGGCATTA AAAGCCGTAC AGCGGGATGC CCTGCGCTTC AGTACAACTT ATGGCGCCAG TGCGGTGGAA TTTGTTCAGT CCACTGAAAG TATTAATTCC GCCATTGCCG GGCTGACCGG TAATGAACTG CCGAAAGTGA CAAAAGTTGC TAATACCCTG GCGTTTGCAC TGAAATCCAC CGCCGCAGAA ACGGCGGAAT TTATGGGGCA GATGTTTGGT AATTTTTCCG CCGATGCGGA GCGTCTGGGC AAGGTTCAGT TCGCTGAACA GCTGGCCGGA AAAATGGTGT ATATGCGTAA AACGTTCGGT ACTGAAATGG CGACGATTAA GGATTTGATG GAAGGTGCAC GCGGCGTGGG GACTAACTAC GGTGTCGGAC TGGATGAACA GCTGGCGGTA CTGGGGCAAT TGAACCGCAC GCTGGGAACG GAAGCCAGCA GCGCCTATGA GGGCTTTATG ACGGGGGCAG TTGAAGGGGC AAAAAAACTG GGTCTGTCCT TTACTGACGC CACCGGAAAA ATGCTGTCCA TGCCTGAGAT GCTGATTAAA TTGCAGGGCA AATACGGCAA GAGCCTGGAA GGGAATCTGA AAGCCCAGGC GGAACTGGAT GCGGCATTCG GTGACAGTTC GGCTGTGGTC AAACACCTTT ACGGTAATGT GGCGCTTCTC CAGAGGAACA TCACCGAACT GGGCGGATCT GACGGTCTGA AACGTACGCA GGAGATGGCC AGTAAACTGG TGAAACCGTG GGATCGGTTT GTACAAATCC TGAAAGCCAT TCAGACCGTA ATAGGGCTGA CACTAATCCC GGTATTGTAT CCGGTGCTGA ATCGTCTGGC GGATATGGGA CAGACATTTG CCAGATGGAT GCAGCTATTT CCCAACATTG CCCGTGTTAT TGGCTATGCC GCTATGGCTT TGCTGGGGTT TGCGGCAGTG GGTGCGGTTG CCAATATCGT CATGGGAGTT TCAAAGTTCA TCATGATGGG TTGGAAGGGG GTATGGAAGT TACTCACAGC GGTTACCAAA ATCGATACGG CCTGGACGTG GCTTAACACA AAAGCAAAGC TGGCGTGGGC TAATGTCATG AAATCATTGC GAGGCATTCT TCTTGCACTC CGTATGCAAG CTATTATGAC AGGCACTGCC ATTAATTTTA TGAGCTGGCC GGTCTTGCTT GTGATCGGGG CGATAGCATT GCTTGCGGCG GGTTGCTGGT TGCTGATTAA ACACTGGGAT ACGGTGAAAG CAGCTGTTAT GGAAACATCC GCGTTTCAGG CGTGTGCCAG GGTGGTGGCG TGGCTGGCCG GGGTGTTTTC CACAGCGTGG CAATTTATCA GTGAAGGCTG GAACAGTTTT ATTGCGCTAT TAACAGGGTT TTCACCCTCA CAGGCATTAA GTGGACTGGC GTCGGGTATT GTATCCATGT TTGATAATGT CTGGCAGTCC GTTAAAGGTG GTTTTCTGAA ATCGTGGAAC TGGATTGTTG AGAAGCTGAA TAAAATACCC GGCGTTGATA TCTCAATGGC TAATGAAACC TCTTCGCCAC CATTAACAGT AAATAATTTA TCTACAGGTG GTGAGCTAAA AGGAATTGAT AAAGGTGGTA TCAGTAAATC TGTCAGTAAT AACTCAAGGT CTGTGACGGA TAACAGCCGG AAAATTAATA CTGTCAATAT CTATCCAAAA GAAATGATAA CGCCGGGGCA GTTAATGGAG TTTCAGGAGC TGGGCGTATG A
|
Protein sequence | MKQLDFTLSL IDKLSRPLKQ AQSSVTGFAE KSKAAFMQIG GGVLALAGTG MAIRGALSPA IEMYDALNDA ASKGIDDQAL KAVQRDALRF STTYGASAVE FVQSTESINS AIAGLTGNEL PKVTKVANTL AFALKSTAAE TAEFMGQMFG NFSADAERLG KVQFAEQLAG KMVYMRKTFG TEMATIKDLM EGARGVGTNY GVGLDEQLAV LGQLNRTLGT EASSAYEGFM TGAVEGAKKL GLSFTDATGK MLSMPEMLIK LQGKYGKSLE GNLKAQAELD AAFGDSSAVV KHLYGNVALL QRNITELGGS DGLKRTQEMA SKLVKPWDRF VQILKAIQTV IGLTLIPVLY PVLNRLADMG QTFARWMQLF PNIARVIGYA AMALLGFAAV GAVANIVMGV SKFIMMGWKG VWKLLTAVTK IDTAWTWLNT KAKLAWANVM KSLRGILLAL RMQAIMTGTA INFMSWPVLL VIGAIALLAA GCWLLIKHWD TVKAAVMETS AFQACARVVA WLAGVFSTAW QFISEGWNSF IALLTGFSPS QALSGLASGI VSMFDNVWQS VKGGFLKSWN WIVEKLNKIP GVDISMANET SSPPLTVNNL STGGELKGID KGGISKSVSN NSRSVTDNSR KINTVNIYPK EMITPGQLME FQELGV
|
| |