Gene SeHA_C3901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3901 
Symbol 
ID6488297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3770183 
End bp3771379 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content57% 
IMG OID642744008 
Producthypothetical protein 
Protein accessionYP_002047614 
Protein GI194449314 
COG category[R] General function prediction only 
COG ID[COG2081] Predicted flavoproteins 
TIGRFAM ID[TIGR00275] flavoprotein, HI0933 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.150346 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones98 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAAAGGT TTGATGCCGT TATTATAGGC GCTGGCGCAG CGGGCATGTT TTGCGCCGCG 
CAGGCAGGAC AGGCGGGTAG CCGCGTGCTG CTCATCGATA ATGGCAAGAA GCCAGGACGT
AAAATCCTCA TGTCTGGCGG CGGGCGCTGC AACTTTACTA ATCTTTATGT TGAGCCTGCC
GCGTATTTGA GCCAGAACCC CCATTTTTGC AAATCAGCGT TAGCCCGCTA TACCCAGTGG
GATTTTATCG ATCTGGTCGG CAGGTATGGG ATAGCCTGGC ATGAGAAAAC GCTGGGACAG
CTTTTTTGCG ATGATTCCGC CCAACGCATT GTCGATATGC TGGTTGCCGA GTGCGACAAA
GGCGGCGTAA CGATGCGCCT GCGTAGCGAG GTATTGAGCG TCGAGCGTGA TGAGTCGGGT
TTCGTACTGG CGTTGAACGG CGAGACGGTG ACTACGCAAA AGCTGGTGAT TGCCAGCGGC
GGCCTGTCGA TGCCGGGGCT TGGCGCATCA CCGTTTGGCT ATAAAATCGC CGAACAGTTT
GGTCTCAAGG TGTTGCCGAC GCGCGCCGGG CTGGTGCCCT TTACGCTACA TAAGCCGCTG
TTAGAACAGC TCCAGACGCT GTCTGGCGTC TCTGTGCCCT GCGTGATTAC CGCTCGCAAT
GGCACGGTAT TTCGGGAAAA CCTGCTTTTT ACCCATCGTG GGCTGTCCGG CCCCGCCGTT
TTACAGATTT CCAGCTACTG GCAACCGGGC GAGTTAGTGA GCATTAACTT ATTGCCGGAC
CTCTCGCTGG AAGATGTTCT CAATGAACAG CGTAACGCGC ACCCGAACCA GAGTCTGAAG
AACACGCTGG CGATGCATCT GCCGAAACGG TTGGTGGAGT GTTTACAACA GTTGGGGCAC
ATCCCGGATG TATCGCTCAG ACAGTTGAAC GTTCGTGACC AGCAGGCGTT GGTTGACACG
CTTACGGCCT GGCAAGTGCA GCCTAACGGC ACCGAAGGCT ATCGGACAGC GGAAGTGACG
CTGGGCGGCG TGGATACAAA CGAACTATCA TCGCGGACTA TGGAAGCGCG CCGCGTGCCG
GGTCTCTATT TTATCGGCGA AGTGATGGAC GTCACCGGCT GGTTGGGCGG CTATAACTTC
CAGTGGGCGT GGTCGAGCGC CTGGGCCTGC GCGCAGGATT TGGCGGCAAA ACGCTAA
 
Protein sequence
MERFDAVIIG AGAAGMFCAA QAGQAGSRVL LIDNGKKPGR KILMSGGGRC NFTNLYVEPA 
AYLSQNPHFC KSALARYTQW DFIDLVGRYG IAWHEKTLGQ LFCDDSAQRI VDMLVAECDK
GGVTMRLRSE VLSVERDESG FVLALNGETV TTQKLVIASG GLSMPGLGAS PFGYKIAEQF
GLKVLPTRAG LVPFTLHKPL LEQLQTLSGV SVPCVITARN GTVFRENLLF THRGLSGPAV
LQISSYWQPG ELVSINLLPD LSLEDVLNEQ RNAHPNQSLK NTLAMHLPKR LVECLQQLGH
IPDVSLRQLN VRDQQALVDT LTAWQVQPNG TEGYRTAEVT LGGVDTNELS SRTMEARRVP
GLYFIGEVMD VTGWLGGYNF QWAWSSAWAC AQDLAAKR