Gene SNSL254_A4836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4836 
Symbol 
ID6487007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4709793 
End bp4711058 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content48% 
IMG OID642740049 
Productintegrase 
Protein accessionYP_002043726 
Protein GI194444728 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.466297 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value0.712135 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTCA ACGCCAGACA GGTCGAGACC GCAAAGCCAA AAGACAAAAC CTACAAAATG 
GCCGATGGCG GCGGTTTGTA TCTTGAGGTT TCGGCCAAGG GTTCTAAATA CTGGCGCATG
AAATACAGAC GCCCCTCTGA CAAAAAAGAG GATCGCCTTG CCTTTGGTGT TTGGCCTACG
GTGACGCTTG CTCAGGCAAG AGCAAAGCGT GACGAGGCTA AAAAGCTCTT AGTACAGGGC
ATTGACCCAA AAGCCGAACA GAAAGAAGCT CAGGCTGAGA ATTCGGGGGC ATACACTTTC
GAAACAATTG CTCGTGAATG GCATGCCAGT AACAAGCGCT GGAGTGAAGA CCATCGATCA
CGCGTTCTGC GATATCTTGA GCTTTATATC TTCCCTCATA TCGGTTCGTC CGACATTCGG
CAGCTTAAAA CCAGCCACCT GTTAGCCCCG ATTAAAAAAG TTGATGCCAG TGGTAAGCAT
GACGTTGCGC AGCGTCTTCA ACAACGTGTC ACGGCTGTAA TGCGTTATGC CGTTCAGAAC
GATTACATCG ACTCTAATCC GGCCAGTGAT ATGGCCGGTG CGCTATCGAC AACCAAAGCG
CGACATTACC CCGCTTTACC CTCAAGTCGA TTCCCTGAAT TTCTTGCACG TCTTGCTGCA
TATCGTGGCC GTGTAATGAC ACGGATCGCG GTTGAGCTTT CCTTACTAAC TTTTGTACGT
TCCAGTGAAT TACGTTTCGC GCGTTGGGAT GAGTTCGACT TCGATAAGTC TCTCTGGCGT
ATACCTGCAA AACGAGAAGA AATTAAAGGC GTGCGGTACT CATACCGTGG CATGAAGATG
AAAGAGGAGC ATATCGTTCC GCTTAGTCTG CAGGCGATGG CTTTGTTAGA GCAGCTTAAG
CAGATGAGTG GTGATAAAGA GCTGCTTTTT CCGGGCGATC ATGACCCAAC CAAGGTTATG
AGTGAAAACA CGGTAAATAG CGCATTACGT GCGATGGGCT ATGACACTAA AACAGATGTC
TGCGGACATG GGTTTAGGAC GATGGCGCGT GGTGCGTTGG GAGAGTCAGG ATTATGGAGC
GATGATGCGA TAGAGCGCCA GCTAAGCCAC TCGGAACGTA ACAATGTACG TGCAGCTTAT
ATTCATACTT CTGAACATTT GGATGAGCGC CGGTTGATGG TCCAGTGGTG GGCTGACTAT
CTTGATGAGA TAAAAATCAT TTACATAACA CCATATGATT TCGCTAAAAT GAATAATCGC
AATTAA
 
Protein sequence
MKLNARQVET AKPKDKTYKM ADGGGLYLEV SAKGSKYWRM KYRRPSDKKE DRLAFGVWPT 
VTLAQARAKR DEAKKLLVQG IDPKAEQKEA QAENSGAYTF ETIAREWHAS NKRWSEDHRS
RVLRYLELYI FPHIGSSDIR QLKTSHLLAP IKKVDASGKH DVAQRLQQRV TAVMRYAVQN
DYIDSNPASD MAGALSTTKA RHYPALPSSR FPEFLARLAA YRGRVMTRIA VELSLLTFVR
SSELRFARWD EFDFDKSLWR IPAKREEIKG VRYSYRGMKM KEEHIVPLSL QAMALLEQLK
QMSGDKELLF PGDHDPTKVM SENTVNSALR AMGYDTKTDV CGHGFRTMAR GALGESGLWS
DDAIERQLSH SERNNVRAAY IHTSEHLDER RLMVQWWADY LDEIKIIYIT PYDFAKMNNR
N