Gene SNSL254_A1129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1129 
Symbol 
ID6486973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1135987 
End bp1137006 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content48% 
IMG OID642736531 
Productsite-specific recombinase, phage integrase family 
Protein accessionYP_002040289 
Protein GI194444962 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value5.3601e-24 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGGCGCA GAAGAAAAGA TCTGGGCGAT GTCAAGCTCC CCCCACGCGT ATCAAAAACC 
AGAACCCGTT ACTACTACAA ACCCACGTCG CGGGAAACTG TGACACTGGG GCCAATCACT
CTCACTATGT CGGCATTATG GAAACGGTAC GAGGAAGAAC GGCGCAATTA CTCAGATGTA
ATGACGTTCG AAAAGCTCTG GAAGATGTTT CTTAAAAGCG CCTACTACAC CGAGCTTGCA
ATACGAACCC AGCGGGATTA TTTGCAACAT CAGAAAAAAT TGCTTGCCGT GTTTGGTAAA
GTTAAAGCTG ATGTAATAAA GCCAGAAGAT GTGCGTCAGT TTATGGATCG TCGTGGACTG
CAAAGTAAAA ACCAGGCCAA CCAGGAAATG AGCAGCATGT CACGTGTTTA CCGCTGGGGG
TATGAACGCG GTTACGTTAA GGGAAATCCG TGTGCCGGCG TCAGTAAATT CTCTCTCAAG
GCTCGCGAGC AATACATCAC TGACGAAGAC TACCTGGCTA TTTATAAGCA TGCTGATCAC
GTTGTCAGGG CTGCAATGGA AATTTCTTAC CTGTGCGCCG CCAGGCAAGC TGACGTACTC
GCTCTGCGCT GGATGCAAAT TTCTGATAAG GGGATTTTTA TCCAGCAAGG AAAGACCGGA
AAAAAACAGA TTAAGGTCTG GACTCCCCGC CTTCAGCAAG CGCTGAAAAC AGCACAGACA
GAATGTCCAA AACTGTCACC TGACGCGCTG GTTCTCTACA ACAACGATCG TGGTCAGTTC
ATCCGCAAGA CGTTCAATAA TCGCTGGTTA AAAGCTGTAC GCGCCGCACA AAGTGAACTG
GGCCGACAAC TGGATTACAC ATTCCACGAT ATCAAGGCAA AAGCTATTTC AGATTTTGAG
GGTAGTAGCA GGGATAAGCA GATTTTCAGC GGCCACAAAA CAGAAAGCCA GGTGCTTATC
TACGACAGGA AGGTACAAAT CAGCCCGACA CTGGATCGTC CGGTTATTGG GGAAAAGTGA
 
Protein sequence
MGRRRKDLGD VKLPPRVSKT RTRYYYKPTS RETVTLGPIT LTMSALWKRY EEERRNYSDV 
MTFEKLWKMF LKSAYYTELA IRTQRDYLQH QKKLLAVFGK VKADVIKPED VRQFMDRRGL
QSKNQANQEM SSMSRVYRWG YERGYVKGNP CAGVSKFSLK AREQYITDED YLAIYKHADH
VVRAAMEISY LCAARQADVL ALRWMQISDK GIFIQQGKTG KKQIKVWTPR LQQALKTAQT
ECPKLSPDAL VLYNNDRGQF IRKTFNNRWL KAVRAAQSEL GRQLDYTFHD IKAKAISDFE
GSSRDKQIFS GHKTESQVLI YDRKVQISPT LDRPVIGEK