Gene SNSL254_A1019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1019 
Symbol 
ID6486435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1033680 
End bp1034912 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content56% 
IMG OID642736425 
Producthypothetical protein 
Protein accessionYP_002040184 
Protein GI194444733 
COG category[S] Function unknown 
COG ID[COG3214] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value0.871139 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATTGC CGTACCTTTC TCTTTCCCAG GCCCGTTGTC TTCACCTTGC TGCGCAGGGG 
CTATTGAAAA AGCCGCGCCG TAACGCGATG CCTGGCGATG TTCTTGCCGC CATCTCACGC
ATGGCGTTGC TGCAAATTGA TACCATCAAT GTTGTCGCAC GTAGCCCCTA TCTGGTGCTG
TTTAGCCGTC TCGGCTCGTA CCCGCAGGCC TGGCTGGATG AGGCGCTGCG ACGCGGCGAG
TTAATGGAAT ACTGGGCGCA TGAGGCCTGT TTCTTACCAC GCCGTGACTT TAAACTTATC
CGCCATCGTA TGCTGTCGCC GGAAAAGATG GGCTGGAAAT ATCGCGCGGC ATGGATGCAT
GAGCACGCGG AAGAAATAGA ACAGCTAATG CGGCATATTC AGGAGCACGG CCCGGTGCGA
TCTGCCGATT TTGAACATGC GCAGAAAGGC GCCAGCGGCT GGTGGGAATG GAAACCACAT
AAACGCCACC TTGAGGGTTT ATTTACCGCC GGAAAAGTCA TGGTTGTTGA GCGGCGTAAT
TTTCAACGTG TATATGATTT AACGCGCCGT GTGATGCCGC ACTGGGATGA TGAACGCGAT
GGACTGTCAC AGCCGCAGGC GGAAAGCCTG ATGCTGGATA ATAGCGCGCG CAGTCTGGGG
ATTTTCCGTG AACAGTGGCT GGCGGATTAC TACCGCCTGA AACGTCCTGA CCTGAAGGGA
TGGCGGGAGA GCCGGGCGGA ACAGCAGCAG ATTATTCCGG TCGAGGTGGA AACGTTGGGG
CGGATGTGGC TTCATGCCGA TCTTCTTTCG CAGCTTGAAC CGGCGCTAAA TAACGCCTTA
AAGGCGACCC ATAGCGCAGT ACTGTCGCCT TTCGATCCTG TGGTATGGGA TCGCAAGCGG
GCAGCGCAGC TCTTCGCATT TAACTATCGG CTGGAATGTT ATACGCCCGC GGCGAAGCGC
CAGTACGGTT ATTTTGTGCT GCCGCTATTA TACCAGGGCC GTTTAGTCGG GCGAATGGAT
GCCAAAATGC ACCGTAAAAC GGGGGTGCTT GAGGTTATCT CGCTGTATCT GGAGGACGAT
ATCCGCCCTG GCGTTAGTCT GCAAAAAGGA ATCTGGCAGG CGATTAGCGC GTTTGCTGCC
TGGCAACGGG CATCGCGCGT GACGCTGGGA CAATGTCCGC CAGGCCTGTT TAGCGCCATG
CGTCATGGCT GGGAAATAGA CCCTGCACCA TAA
 
Protein sequence
MSLPYLSLSQ ARCLHLAAQG LLKKPRRNAM PGDVLAAISR MALLQIDTIN VVARSPYLVL 
FSRLGSYPQA WLDEALRRGE LMEYWAHEAC FLPRRDFKLI RHRMLSPEKM GWKYRAAWMH
EHAEEIEQLM RHIQEHGPVR SADFEHAQKG ASGWWEWKPH KRHLEGLFTA GKVMVVERRN
FQRVYDLTRR VMPHWDDERD GLSQPQAESL MLDNSARSLG IFREQWLADY YRLKRPDLKG
WRESRAEQQQ IIPVEVETLG RMWLHADLLS QLEPALNNAL KATHSAVLSP FDPVVWDRKR
AAQLFAFNYR LECYTPAAKR QYGYFVLPLL YQGRLVGRMD AKMHRKTGVL EVISLYLEDD
IRPGVSLQKG IWQAISAFAA WQRASRVTLG QCPPGLFSAM RHGWEIDPAP