Gene SNSL254_pSN254_0038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_pSN254_0038 
SymbolinsA 
ID4929420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_009140 
Strand
Start bp22905 
End bp24398 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content61% 
IMG OID642572342 
Producttransposase InsA 
Protein accessionYP_001101917 
Protein GI134047191 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value0.83656 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCATG TGGCGGCCAG GACGGCCAGC CGGGATCGGG ATACTGGTCG TTACCAGAGC 
CACCGACCCG AGCAAACCCT TCTCTATCAG ATCGTTGACG AGTATTACCC GGCATTCGCT
GCGCTTATGG CAGAGCAGGG AAAGGAATTG CCGGGCTATG TGCAACGGGA ATTTGAAGAA
TTTCTCCAAT GCGGGCGGCT GGAGCATGGC TTTCTACGGG TTCGCTGCGA GTCTTGCCAC
GCCGAGCACC TGGTCGCTTT CAGCTGTAAG CGTCGCGGTT TCTGCCCGAG CTGTGGGGCG
CGGCGGATGG CCGAAAGTGC CGCCTTGCTG GTTGATGAAG TACTGCCTGA ACAACCCATG
CGTCAGTGGG TGTTGAGCTT CCCGTTTCAG CTGCGTTTCC TGTTTGCCAG CCGGCCCGAG
ATCATGGGGT GGGTGCTGGG CATCGTTTAC CGCGTCATTG CCACGCACCT GGTCAAGAAA
GCGGGCCATA CCCACCAAGT GGCCAAGACG GGCGCGGTCA CCCTGATCCA GCGTTTTGGA
TCGGCGCTCA ATCTGAATGT TCACTTCCAC ATGCTGTTTC TCGACGGTGT GTATGTCGAG
CAATCCCACG GCTCAGCGCG TTTCCGCTGG GTCAAGGCGC CGACCAGCCC AGAGCTCACC
CAGCTGACGC ACACCATCGC CCACCGGGTG GGTCGCTATC TGGAACGGCA AGGCCTGCTG
GAACGGGATG TCGAAAACAG CTATCTGGCC TCGGATGCGG TGGATGACGA CCCGATGACA
CCCCTGCTGG GGCACTCGAT CACTTACCGT ATCGCTGTCG GTTCACAGGC GGGGCGAAAG
GTGTTCACTT TGCAAACTCT GCCGACCAGT GGTGATCCGT TCGGTGACGG GATTGGCAAG
GTAGCCGGGT CCAGCCTGCA CGCCGGCGTG GCGGCCAGGG CCGATGAACG CAAGAAGCTC
GAACGGCTGT GCCGGTACAT CAGCCGCCCG GCGGTATCCG AGAAGCGGCT GTCGTTAACA
CGAGGCGGCA ACGTGCGCTA CCAGCTCAAG ACGCCGTACC GGGACGGCAC CACGCACGTC
ATTTTCGAAC CATTGGATTT CATTGCAAGG CTGGCCGCCC TGGTACCGAA GCCCAGAGTC
AACCTAACCC GCTTCCACGG GGTGTTCGCA CCCAACAGTC GGCACCGGGC GTTGGTCACG
CCGGCAAAAC GGGGCAGGGG CAACAAGGTC AGGGTGGCTG ATGAACCGGC AACACCAGCA
CAACGGCGAG CGTCGATGAC ATGGGCGCAA CGGCTCAAGC GTGTTTTCAA TATCGACATC
GAGACCTGCA GCGGCTGCGG CGGCGCCATG AAAGTCATCG CCTGCATTGA AGACCCTATA
GTGATCAAGC AGATCCTTGA TCACCTGAAG CACAAAGCCG AAACCAGCGG GACCAGGGCG
TTACCCGAAA GCCGGGCGCC ACCGGCTGAG CTGCTCCTGG GTCTGTTTGA CTGA
 
Protein sequence
MPHVAARTAS RDRDTGRYQS HRPEQTLLYQ IVDEYYPAFA ALMAEQGKEL PGYVQREFEE 
FLQCGRLEHG FLRVRCESCH AEHLVAFSCK RRGFCPSCGA RRMAESAALL VDEVLPEQPM
RQWVLSFPFQ LRFLFASRPE IMGWVLGIVY RVIATHLVKK AGHTHQVAKT GAVTLIQRFG
SALNLNVHFH MLFLDGVYVE QSHGSARFRW VKAPTSPELT QLTHTIAHRV GRYLERQGLL
ERDVENSYLA SDAVDDDPMT PLLGHSITYR IAVGSQAGRK VFTLQTLPTS GDPFGDGIGK
VAGSSLHAGV AARADERKKL ERLCRYISRP AVSEKRLSLT RGGNVRYQLK TPYRDGTTHV
IFEPLDFIAR LAALVPKPRV NLTRFHGVFA PNSRHRALVT PAKRGRGNKV RVADEPATPA
QRRASMTWAQ RLKRVFNIDI ETCSGCGGAM KVIACIEDPI VIKQILDHLK HKAETSGTRA
LPESRAPPAE LLLGLFD