Gene SNSL254_A2922 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2922 
Symbol 
ID6484727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2857924 
End bp2859894 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content49% 
IMG OID642738239 
Producthypothetical protein 
Protein accessionYP_002041968 
Protein GI194446116 
COG category 
COG ID 
TIGRFAM ID[TIGR01760] phage tail tape measure protein, TP901 family, core region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0826266 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0000000000000799042 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACAGT TAGATTTTAC ATTAAGCCTG ATTGATAAGC TGTCCCGCCC GTTAAAACAG 
GCACAGAGCA GCGTCACCGG CTTTGCGGAA AAATCAAAAG CGGCCTTTAT GCAGATTGGC
GGTGGTGTGC TGGCTTTAGC GGGTACAGGA ATGGCCATAC GGGGTGCGTT ATCACCGGCA
ATTGAAATGT ATGATGCGCT GAATGATGCA GCATCAAAAG GGATTGATGA TCAGGCATTA
AAAGCCGTAC AGCGGGATGC CCTGCGCTTC AGTACAACTT ATGGCGCCAG TGCGGTGGAA
TTTGTTCAGT CCACTGAAAG TATTAATTCC GCCATTGCCG GGCTGACCGG TAATGAACTG
CCGAAAGTGA CAAAAGTTGC TAATACCCTG GCGTTTGCAC TGAAATCCAC CGCCGCAGAA
ACGGCGGAAT TTATGGGGCA GATGTTTGGT AATTTTTCCG CCGATGCGGA GCGTCTGGGC
AAGGTTCAGT TCGCTGAACA GCTGGCCGGA AAAATGGTGT ATATGCGTAA AACGTTCGGT
ACTGAAATGG CGACGATTAA GGATTTGATG GAAGGTGCAC GCGGCGTGGG GACTAACTAC
GGTGTCGGAC TGGATGAACA GCTGGCGGTA CTGGGGCAAT TGAACCGCAC GCTGGGAACG
GAAGCCAGCA GCGCCTATGA GGGCTTTATG ACGGGGGCAG TTGAAGGGGC AAAAAAACTG
GGTCTGTCCT TTACTGACGC CACCGGAAAA ATGCTGTCCA TGCCTGAGAT GCTGATTAAA
TTGCAGGGCA AATACGGCAA GAGCCTGGAA GGGAATCTGA AAGCCCAGGC GGAACTGGAT
GCGGCATTCG GTGACAGTTC GGCTGTGGTC AAACACCTTT ACGGTAATGT GGCGCTTCTC
CAGAGGAACA TCACCGAACT GGGCGGATCT GACGGTCTGA AACGTACGCA GGAGATGGCC
AGTAAACTGG TGAAACCGTG GGATCGGTTT GTACAAATCC TGAAAGCCAT TCAGACCGTA
ATAGGGCTGA CACTAATCCC GGTATTGTAT CCGGTGCTGA ATCGTCTGGC GGATATGGGA
CAGACATTTG CCAGATGGAT GCAGCTATTT CCCAACATTG CCCGTGTTAT TGGCTATGCC
GCTATGGCTT TGCTGGGGTT TGCGGCAGTG GGTGCGGTTG CCAATATCGT CATGGGAGTT
TCAAAGTTCA TCATGATGGG TTGGAAGGGG GTATGGAAGT TACTCACAGC GGTTACCAAA
ATCGATACGG CCTGGACGTG GCTTAACACA AAAGCAAAGC TGGCGTGGGC TAATGTCATG
AAATCATTGC GAGGCATTCT TCTTGCACTC CGTATGCAAG CTATTATGAC AGGCACTGCC
ATTAATTTTA TGAGCTGGCC GGTCTTGCTT GTGATCGGGG CGATAGCATT GCTTGCGGCG
GGTTGCTGGT TGCTGATTAA ACACTGGGAT ACGGTGAAAG CAGCTGTTAT GGAAACATCC
GCGTTTCAGG CGTGTGCCAG GGTGGTGGCG TGGCTGGCCG GGGTGTTTTC CACAGCGTGG
CAATTTATCA GTGAAGGCTG GAACAGTTTT ATTGCGCTAT TAACAGGGTT TTCACCCTCA
CAGGCATTAA GTGGACTGGC GTCGGGTATT GTATCCATGT TTGATAATGT CTGGCAGTCC
GTTAAAGGTG GTTTTCTGAA ATCGTGGAAC TGGATTGTTG AGAAGCTGAA TAAAATACCC
GGCGTTGATA TCTCAATGGC TAATGAAACC TCTTCGCCAC CATTAACAGT AAATAATTTA
TCTACAGGTG GTGAGCTAAA AGGAATTGAT AAAGGTGGTA TCAGTAAATC TGTCAGTAAT
AACTCAAGGT CTGTGACGGA TAACAGCCGG AAAATTAATA CTGTCAATAT CTATCCAAAA
GAAATGATAA CGCCGGGGCA GTTAATGGAG TTTCAGGAGC TGGGCGTATG A
 
Protein sequence
MKQLDFTLSL IDKLSRPLKQ AQSSVTGFAE KSKAAFMQIG GGVLALAGTG MAIRGALSPA 
IEMYDALNDA ASKGIDDQAL KAVQRDALRF STTYGASAVE FVQSTESINS AIAGLTGNEL
PKVTKVANTL AFALKSTAAE TAEFMGQMFG NFSADAERLG KVQFAEQLAG KMVYMRKTFG
TEMATIKDLM EGARGVGTNY GVGLDEQLAV LGQLNRTLGT EASSAYEGFM TGAVEGAKKL
GLSFTDATGK MLSMPEMLIK LQGKYGKSLE GNLKAQAELD AAFGDSSAVV KHLYGNVALL
QRNITELGGS DGLKRTQEMA SKLVKPWDRF VQILKAIQTV IGLTLIPVLY PVLNRLADMG
QTFARWMQLF PNIARVIGYA AMALLGFAAV GAVANIVMGV SKFIMMGWKG VWKLLTAVTK
IDTAWTWLNT KAKLAWANVM KSLRGILLAL RMQAIMTGTA INFMSWPVLL VIGAIALLAA
GCWLLIKHWD TVKAAVMETS AFQACARVVA WLAGVFSTAW QFISEGWNSF IALLTGFSPS
QALSGLASGI VSMFDNVWQS VKGGFLKSWN WIVEKLNKIP GVDISMANET SSPPLTVNNL
STGGELKGID KGGISKSVSN NSRSVTDNSR KINTVNIYPK EMITPGQLME FQELGV