Gene SNSL254_A4045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4045 
Symbol 
ID6486155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3936008 
End bp3936937 
Gene Length930 bp 
Protein Length309 aa 
Translation table11 
GC content50% 
IMG OID642739303 
Producthypothetical protein 
Protein accessionYP_002043012 
Protein GI194446756 
COG category[S] Function unknown 
COG ID[COG5464] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01784] conserved hypothetical protein (putative transposase or invertase) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.673826 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.0374243 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA GCCCCACGTC CACGCCTCAT GATGCGGTAT TCAAAACGTT TTTACGCCAT 
CCGGATACCG CGCGGGATTT TCTCAATATT CATCTTCCCC ATTCGCTAAG AATACGTTGC
GATCTGACGA CGTTAAAACT GGCGCCGGAC AGTTTTATCG AGAAAAATTT ACGCGCGTTT
TATTCCGACG TCCTTTGGTC GCTAAAAACG TGTGAAGGCG ATGGTTATAT CTATGTCGTT
ATAGAGCATC AGAGTACGCC GGACGCGCAT ATGGCGTTCC GGTTAATGCG TTACGCGACT
GCCGCGATGC AGCGCCATCT GGATGCTGGC CATAAAACGT TACCGCTGGT GATTCCCATG
CTGTTTTACC ATGGCGCGAA AAGCCCGTAT CCCTTTTCGC TTTGCTGGCT GGATGAGTTT
GACGATCCTG CACTGGCGCG TCAGCTTTAT GCGACGGCAT TTCCACTGGT AGACATTACG
GTGGTGCCGG ATAACGAGAT TATGCAGCAT CGACGTATCG CGATGCTGGA ACTGGTACAA
AAGCATATAC GTCAGCGCGA CCTGATGGGA TTGGTCGAGC GTTTAGCGGT ACTTCTGATT
ACGGGAAACG CTAATGACAG TCAGCTAAAA GCGCTGTTTA ATTATTTGCT AATACAGCAT
GGCAGCACGC CTCGTTTTGG CAAGTTTATC CGCGAGGTGG CGCGTCGTGT TCCCCAACAC
AAGGAGAGAT TAATGACGAT CGTAGACAGA ATACGTGAAT CGGGGCGCAG AAAAGGTAAG
CGTGAAGGCG TGCAACAAGG TATACATCAA GGTAAGCAGG AGGAAGCCTT GCGTATTGCG
CATACGATGC TGGAACAGGG GATCAATCGA GAGATGGTGC TGATGATTAC CGGGCTTTCT
GACGAAGAGA TTAAGGCAAA GCGCCATTAA
 
Protein sequence
MKKSPTSTPH DAVFKTFLRH PDTARDFLNI HLPHSLRIRC DLTTLKLAPD SFIEKNLRAF 
YSDVLWSLKT CEGDGYIYVV IEHQSTPDAH MAFRLMRYAT AAMQRHLDAG HKTLPLVIPM
LFYHGAKSPY PFSLCWLDEF DDPALARQLY ATAFPLVDIT VVPDNEIMQH RRIAMLELVQ
KHIRQRDLMG LVERLAVLLI TGNANDSQLK ALFNYLLIQH GSTPRFGKFI REVARRVPQH
KERLMTIVDR IRESGRRKGK REGVQQGIHQ GKQEEALRIA HTMLEQGINR EMVLMITGLS
DEEIKAKRH