Gene SNSL254_A4321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4321 
Symbol 
ID6484654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4210649 
End bp4211749 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content45% 
IMG OID642739565 
Producthypothetical protein 
Protein accessionYP_002043259 
Protein GI194443990 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.932757 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value0.276393 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGGA ATTTATTATC CTCAGCGATA ATCATTGCGC TAATGACCCT GGGCGCAACA 
GGATGTGATG ACAATAATGT TAAAACCGAG GCGACGCCGG CCGCCAGCAG TCAGCCTGCG
ACGCCAGCGC CTTCTCAGAC GCCGGAAACG CAATCTGACG AAAGTCCAGC GCAGCCCTCA
GCAGCGAAGC CAGAAACGGC AACTCAGCCC CCGGTGGCGA AACCAGAAAC GCCAGCTCAG
CCGGAGGTTG ACGCTGAAGA AGTTTATAGT GAAAAAATGG ATGTCTATAT CGATTGTTTT
AATAAACTTC AATTGCCCGT TCAGCACAGT CTGGCGCGTT ACGCGGATTG GGTGAAAGAC
TTTAAAAAAG GTCCGACAGG GAAAGAGAGC CTGGTTTATG GCATTTATGG TATTACGGAG
TCTTACATAA CGAATTGCCA GAAAGAGATG AAACAGGTGG CCGCCTTAAC GCCATTACTT
GAGCCTATTG ATGGCGTTGC CGTTAGCTAT ATTGATAGCG CCGCTGCGCT GGGTAATACC
ATTAACGAAA TGGAAAAATA TTATACCCAG GAAAACTATA AAGATGATGC CTTTGCTAAA
GGTAAGGCGC TGCATCAGAC ATTACTGAAG AATATCGAGG ATTTTAAACC CGTCTCGGAA
AAATATCATG AGGCTATTCA GGAAATAAAT GACAGGCGGC AATTGACACA GTTGAAGAGA
ATAGAAGAAG CGGAAGGCAA AACATTTAAC TATTATTCTC TGGCTGTCAT GATTTCGGCA
AAGCAGATCA ACAAGGTTAT TTCTGCCGAT ACCTTTGATG CCGAAGCGAT GATGAAAAAA
GTCGCGGAAC TGGAAACAAT GATTGCGCAA TTGAAAGAAG TGAATACTGA TGGCCGTAAT
TCTTCTTTCA TCAGCTCTGC GGCTGATTAT CAGCTACAAG CTAAAAAATA TATTCGTCGC
ATCAGAGACA ATGTTGAGTA TTCTGATTTT GAAAAGAAAC GGGTGCAGGA CCCTGCAACA
GGATGGATGG TTGCGGATTC TTATCCGGCG TCGTTGAGAA GTTATAACGA GATGGTGGAT
GATTATAACC GCCTGCGTTG A
 
Protein sequence
MKRNLLSSAI IIALMTLGAT GCDDNNVKTE ATPAASSQPA TPAPSQTPET QSDESPAQPS 
AAKPETATQP PVAKPETPAQ PEVDAEEVYS EKMDVYIDCF NKLQLPVQHS LARYADWVKD
FKKGPTGKES LVYGIYGITE SYITNCQKEM KQVAALTPLL EPIDGVAVSY IDSAAALGNT
INEMEKYYTQ ENYKDDAFAK GKALHQTLLK NIEDFKPVSE KYHEAIQEIN DRRQLTQLKR
IEEAEGKTFN YYSLAVMISA KQINKVISAD TFDAEAMMKK VAELETMIAQ LKEVNTDGRN
SSFISSAADY QLQAKKYIRR IRDNVEYSDF EKKRVQDPAT GWMVADSYPA SLRSYNEMVD
DYNRLR