Gene SNSL254_A3858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3858 
Symbol 
ID6486009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3731624 
End bp3732820 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content57% 
IMG OID642739123 
Producthypothetical protein 
Protein accessionYP_002042834 
Protein GI194446673 
COG category[R] General function prediction only 
COG ID[COG2081] Predicted flavoproteins 
TIGRFAM ID[TIGR00275] flavoprotein, HI0933 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00469108 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones92 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAAAGGT TTGATGCCGT TATTATAGGC GCTGGCGCAG CGGGCATGTT TTGCGCCGCG 
CAGGCAGGAC AGGCGGGTAG CCGCGTGCTG CTCATCGATA ATGGCAAGAA GCCAGGACGT
AAAATCCTCA TGTCCGGCGG TGGGCGCTGC AACTTTACTA ATCTTTATGT TGAGCCTGCT
GCGTATTTGA GCCAGAACCC CCATTTTTGT AAATCAGCGT TAGCCCGCTA TACCCAGTGG
GATTTTATCG ATCTGGTCGA CAGGTATGGG ATAGCCTGGC ATGAGAAAAC GCTGGGACAG
CTTTTTTGCG ATGATTCCGC CCAACGCATT GTCGATATGC TGGTTGCCGA GTGCGACAAA
GGCGGCGTAA CGATGCGCCT GCGTAGCGAG GTACTGAGCG TCGAGCGTGA TGAGTCGGGT
TTCATACTGG CGTTGAACGG CGAGACGGTG ACTACGCAAA AGCTGGTGAT TGCCAGCGGC
GGCCTGTCGA TGCCGGGGCT TGGCGCATCG CCGTTTGGCT ATAAAATCGC CGAACAGTTT
GGTCTCAAGG TGTTGCCGAC GCGCGCCGGG CTGGTGCCCT TTACGCTGCA TAAGCCGCTG
TTAGAACAGC TCCAGACGCT GTCTGGCGTC TCTGTGCCCT GCGTGATTAC CGCCCGCAAT
GGTACGGTAT TTCGGGAAAA CCTACTTTTT ACCCATCGTG GGCTGTCCGG CCCCGCCGTT
TTACAGATTT CCAGCTACTG GCAACCGGGC GAGTTAGTGA GCATTAACTT ATTGCCGGAT
CTCTCGCTGG AAGACGTTCT CAATGAACAG CGTAACGCGC ACCCGAACCA GAGTCTGAAG
AACACGCTGG CGATGCATCT GCCGAAACGG TTGGTGGAGT GTTTACAACA GTTGGGGCAG
ATCCCGGATG TATCGCTCAG GCAGTTGAAC GTTCGTGACC AGCAGGCGTT GGTTGACACG
CTTACGGCCT GGCAAGTGCA GCCTAACGGC ACCGAAGGCT ATCGGACAGC GGAAGTGACG
CTGGGCGGCG TGGATACAAA CGAACTATCA TCGCGGACTA TGGAAGCGCG CCGCGTGCCG
GGTCTCTATT TTATTGGCGA AGTGATGGAC GTCACCGGCT GGTTGGGCGG CTATAACTTC
CAGTGGGCCT GGTCGAGCGC CTGGGCCTGC GCGCAGGATT TGGCGGCAAA ACGCTAA
 
Protein sequence
MERFDAVIIG AGAAGMFCAA QAGQAGSRVL LIDNGKKPGR KILMSGGGRC NFTNLYVEPA 
AYLSQNPHFC KSALARYTQW DFIDLVDRYG IAWHEKTLGQ LFCDDSAQRI VDMLVAECDK
GGVTMRLRSE VLSVERDESG FILALNGETV TTQKLVIASG GLSMPGLGAS PFGYKIAEQF
GLKVLPTRAG LVPFTLHKPL LEQLQTLSGV SVPCVITARN GTVFRENLLF THRGLSGPAV
LQISSYWQPG ELVSINLLPD LSLEDVLNEQ RNAHPNQSLK NTLAMHLPKR LVECLQQLGQ
IPDVSLRQLN VRDQQALVDT LTAWQVQPNG TEGYRTAEVT LGGVDTNELS SRTMEARRVP
GLYFIGEVMD VTGWLGGYNF QWAWSSAWAC AQDLAAKR