Gene SeSA_A3781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A3781 
Symbol 
ID6517911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp3641037 
End bp3642233 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content57% 
IMG OID642748760 
Producthypothetical protein 
Protein accessionYP_002116524 
Protein GI194738325 
COG category[R] General function prediction only 
COG ID[COG2081] Predicted flavoproteins 
TIGRFAM ID[TIGR00275] flavoprotein, HI0933 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAAAGGT TTGATGCCGT TATTATAGGC GCTGGCGCAG CGGGCATGTT TTGCGCCGCG 
CAGGCAGGAC AGGCGGGTAG CCGCGTGCTG CTCGTCGATA ATGGCAAGAA GCCAGGACGT
AAAATCCTCA TGTCCGGCGG TGGCCGCTGC AACTTTACTA ATCTTTATGT TGAGCCTGCT
GCGTATTTGA GCCAGAACCC CCATTTTTGC AAATCAGCAT TAGCCCGCTA TACCCAGTGG
GACTTTATCG ATCTGGTCGG CAGGTATGGG ATAGCCTGGC ATGAGAAAAC GCTGGGACAG
CTTTTTTGCG ATGATTCCGC CCAACGCATT GTCGATATGC TGGTTGCCGA GTGCGACAAA
GGCGGCGTAA CGATGCGCCT GCGTAGCGAG GTACTGAGCG TCGAGCGTGA TGAGTCGGGT
TTCATACTGG CGCTGAACGG CGAGACGGTG ACTACGCAAA AGCTGGTGAT TGCCAGCGGC
GGCCTGTCGA TGCCGGGGCT TGGCGCATCG CCGTTTGGCT ATAAAATCGC CGAACAGTTT
GGTCTCAAGG TGTTGCCGAC GCGCGCCGGG CTGGTGCCCT TTACGCTGCA TAAGCCGCTG
TTAGAACAGC TCCAGACGCT GTCTGGCGTC TCTGTGCCCT GCGTGATTAC CGCCCGCAAT
GGCACGGTAT TTCGGGAAAA CCTACTTTTT ACCCATCGTG GGCTGTCCGG CCCCGCTGTT
TTACAGATTT CCAGCTACTG GCAACCGGGC GAGTTAGTGA GCATTAACTT ATTGCCGGAC
CTCTCGCTGG AAGACGTTCT CAATGAACAG CGTAACGCGC ACCCGAACCA GAGTCTGAAG
AACACGCTGG CGATGCATTT GCCGAAACGG TTGGTGGAGT GTTTACAACA GTTGGGGCAC
ATCCCGGATG TATCGCTCAG GCAGTTGAAC GTTCGTGACC AGCAGGCGTT AGTTGACACG
CTTACGGCCT GGCAAGTGCA GCCTAACGGC ACCGAAGGCT ATCGGACAGC GGAAGTGACG
CTGGGCGGCG TGGATACAAA CGAACTATCA TCGCGGACTA TGGAAGCGCG CCGCGTGCCG
GGTCTCTATT TTATCGGCGA AGTGATGGAC GTCACCGGCT GGTTGGGCGG CTATAACTTC
CAGTGGGCGT GGTCGAGCGC CTGGGCCTGC GCGCAGGATT TGGCGGCAAA ACGCTAA
 
Protein sequence
MERFDAVIIG AGAAGMFCAA QAGQAGSRVL LVDNGKKPGR KILMSGGGRC NFTNLYVEPA 
AYLSQNPHFC KSALARYTQW DFIDLVGRYG IAWHEKTLGQ LFCDDSAQRI VDMLVAECDK
GGVTMRLRSE VLSVERDESG FILALNGETV TTQKLVIASG GLSMPGLGAS PFGYKIAEQF
GLKVLPTRAG LVPFTLHKPL LEQLQTLSGV SVPCVITARN GTVFRENLLF THRGLSGPAV
LQISSYWQPG ELVSINLLPD LSLEDVLNEQ RNAHPNQSLK NTLAMHLPKR LVECLQQLGH
IPDVSLRQLN VRDQQALVDT LTAWQVQPNG TEGYRTAEVT LGGVDTNELS SRTMEARRVP
GLYFIGEVMD VTGWLGGYNF QWAWSSAWAC AQDLAAKR