Gene SNSL254_A3142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3142 
Symbol 
ID6483459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3053798 
End bp3054844 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content52% 
IMG OID642738453 
Productalkaline phosphatase isozyme conversion aminopeptidase 
Protein accessionYP_002042177 
Protein GI194443359 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value0.634375 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTCCG CAACGCGCCG CTTTGCCGTC ATTCTGGCGC TCGGCGTAGG CTTTATCCTT 
CCTGCACAAG CAGCATCACC AGGGCCTGGT GAAATAGCGA ATACTCAGGC ACGACATATC
GCCACCTTTT TTCCCGGGAG AATGACGGGC TCCCCCGCCG AGATGTTGTC TGCCGATTAT
TTACGCCAAC AATTTACCCA GATGGGATAC CAAAGCGATA TTCGAACGTT TAATAGCCGA
TTTATTTATA CCACGAAGGA TAATCGCAAA AACTGGCATA ACGTGACGGG CAGCACGGTC
ATCGCCGCCC ATGAAGGGCG CGTGCCGCAA CAGATCATCA TTATGGCGCA TCTGGATACG
TACGCTCCGC AGAGCGACGC TGATGTCGAT GCCAATCTGG GCGGTTTAAC GTTACAGGGA
ATGGATGATA ATGCCGCGGG ATTAGGCGTT ATGCTGGAAC TGGCGGCGCG TCTGAAAGAT
ATACCGACCC ATTATGGGAT TCGTTTTATC GCCACCAGCG GGGAAGAAGA GGGAAAGCTA
GGCGCGGAAA ATTTACTCAA ACGAATGAGT GACGCTGAGA AGAAAAATAC GCTGCTGGTG
ATTAATCTCG ATAACCTGAT TGTTGGCGAC AAGCTCTATT TTAATAGCGG GAAAAATACG
CCGGAAGCGG TGCGTACACT GACCCGCGAT CGAGCATTAG CGATTGCGCG CCGTTATGGT
ATCGCCGCCA ACACCAATCC GGGACGCAAT CCATCCTACC CCAAAGGAAC GGGTTGCTGT
AATGATGCGG AGGTTTTCGA TAAAGCGGGA ATATCGGTGC TTTCTGTTGA GGCGACGAAC
TGGAATCTGG GTAAAAAAGA CGGATACCAG CAACGCGTGA AAAATGCATC CTTCCCGAAC
GGCAATAGCT GGCACGACGT ACGGCTTGAT AATCAACAGC ATATTGACAA GGCGCTGCCT
GGGCGAATTG AGCGCCGTAG CCGCGATGTA GTGCGGATAA TGCTGCCGTT GGTAAAAGAG
CTGGCGAAGG CGGAAAAAAC GTCCTGA
 
Protein sequence
MFSATRRFAV ILALGVGFIL PAQAASPGPG EIANTQARHI ATFFPGRMTG SPAEMLSADY 
LRQQFTQMGY QSDIRTFNSR FIYTTKDNRK NWHNVTGSTV IAAHEGRVPQ QIIIMAHLDT
YAPQSDADVD ANLGGLTLQG MDDNAAGLGV MLELAARLKD IPTHYGIRFI ATSGEEEGKL
GAENLLKRMS DAEKKNTLLV INLDNLIVGD KLYFNSGKNT PEAVRTLTRD RALAIARRYG
IAANTNPGRN PSYPKGTGCC NDAEVFDKAG ISVLSVEATN WNLGKKDGYQ QRVKNASFPN
GNSWHDVRLD NQQHIDKALP GRIERRSRDV VRIMLPLVKE LAKAEKTS