Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C3901 |
Symbol | |
ID | 6488297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | - |
Start bp | 3770183 |
End bp | 3771379 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642744008 |
Product | hypothetical protein |
Protein accession | YP_002047614 |
Protein GI | 194449314 |
COG category | [R] General function prediction only |
COG ID | [COG2081] Predicted flavoproteins |
TIGRFAM ID | [TIGR00275] flavoprotein, HI0933 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.150346 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 98 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAAAGGT TTGATGCCGT TATTATAGGC GCTGGCGCAG CGGGCATGTT TTGCGCCGCG CAGGCAGGAC AGGCGGGTAG CCGCGTGCTG CTCATCGATA ATGGCAAGAA GCCAGGACGT AAAATCCTCA TGTCTGGCGG CGGGCGCTGC AACTTTACTA ATCTTTATGT TGAGCCTGCC GCGTATTTGA GCCAGAACCC CCATTTTTGC AAATCAGCGT TAGCCCGCTA TACCCAGTGG GATTTTATCG ATCTGGTCGG CAGGTATGGG ATAGCCTGGC ATGAGAAAAC GCTGGGACAG CTTTTTTGCG ATGATTCCGC CCAACGCATT GTCGATATGC TGGTTGCCGA GTGCGACAAA GGCGGCGTAA CGATGCGCCT GCGTAGCGAG GTATTGAGCG TCGAGCGTGA TGAGTCGGGT TTCGTACTGG CGTTGAACGG CGAGACGGTG ACTACGCAAA AGCTGGTGAT TGCCAGCGGC GGCCTGTCGA TGCCGGGGCT TGGCGCATCA CCGTTTGGCT ATAAAATCGC CGAACAGTTT GGTCTCAAGG TGTTGCCGAC GCGCGCCGGG CTGGTGCCCT TTACGCTACA TAAGCCGCTG TTAGAACAGC TCCAGACGCT GTCTGGCGTC TCTGTGCCCT GCGTGATTAC CGCTCGCAAT GGCACGGTAT TTCGGGAAAA CCTGCTTTTT ACCCATCGTG GGCTGTCCGG CCCCGCCGTT TTACAGATTT CCAGCTACTG GCAACCGGGC GAGTTAGTGA GCATTAACTT ATTGCCGGAC CTCTCGCTGG AAGATGTTCT CAATGAACAG CGTAACGCGC ACCCGAACCA GAGTCTGAAG AACACGCTGG CGATGCATCT GCCGAAACGG TTGGTGGAGT GTTTACAACA GTTGGGGCAC ATCCCGGATG TATCGCTCAG ACAGTTGAAC GTTCGTGACC AGCAGGCGTT GGTTGACACG CTTACGGCCT GGCAAGTGCA GCCTAACGGC ACCGAAGGCT ATCGGACAGC GGAAGTGACG CTGGGCGGCG TGGATACAAA CGAACTATCA TCGCGGACTA TGGAAGCGCG CCGCGTGCCG GGTCTCTATT TTATCGGCGA AGTGATGGAC GTCACCGGCT GGTTGGGCGG CTATAACTTC CAGTGGGCGT GGTCGAGCGC CTGGGCCTGC GCGCAGGATT TGGCGGCAAA ACGCTAA
|
Protein sequence | MERFDAVIIG AGAAGMFCAA QAGQAGSRVL LIDNGKKPGR KILMSGGGRC NFTNLYVEPA AYLSQNPHFC KSALARYTQW DFIDLVGRYG IAWHEKTLGQ LFCDDSAQRI VDMLVAECDK GGVTMRLRSE VLSVERDESG FVLALNGETV TTQKLVIASG GLSMPGLGAS PFGYKIAEQF GLKVLPTRAG LVPFTLHKPL LEQLQTLSGV SVPCVITARN GTVFRENLLF THRGLSGPAV LQISSYWQPG ELVSINLLPD LSLEDVLNEQ RNAHPNQSLK NTLAMHLPKR LVECLQQLGH IPDVSLRQLN VRDQQALVDT LTAWQVQPNG TEGYRTAEVT LGGVDTNELS SRTMEARRVP GLYFIGEVMD VTGWLGGYNF QWAWSSAWAC AQDLAAKR
|
| |