Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A2389 |
Symbol | |
ID | 6874730 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 2259159 |
End bp | 2260514 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 642785480 |
Product | polyhedral body protein |
Protein accession | YP_002216138 |
Protein GI | 198245042 |
COG category | [C] Energy production and conversion |
COG ID | [COG4656] Predicted NADH:ubiquinone oxidoreductase, subunit RnfC |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 73 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCCG CCATCAATAG CGTTGAAATG TCCCATAGCG CCGATGAAAT TCGCGAGCGC GTCCGCGCAG CGGGCGTTGT CGGCGCGGGC GGCGCAGGTT TTCCCGCTCA CGTCAAACTA CAGGCGCAGG TAGAGATTTT TCTGGTGAAC GCCGCCGAAT GTGAACCGAT GCTCAAAGTT GACCAGCAAC TGATGTGGCA GCAGGCCGCG CGTCTTGTTC GCGGCGTGCA GTACGCGATG ACGGCAACCG GCGCGCGCGA AGGGGTGATA GCCCTGAAAG AAAAATACCG CCGGGCCATC GACGCCCTCA CTCCACTGCT GCCAGACGGT ATCCGCCTGC ATATCCTGCC GGATGTATAT CCCGCGGGCG ATGAGGTGCT GACTATCTGG ATGGCAACCG GTCGTCGGGT CGCCCCGGCT GCGCTGCCTG CCAGCGTCGG CGTGGTCGTC AATAACGTGC AAACGGTACT CAATATTGCC CGGGCGGTCG AGCAGCAGTT TCCGGTCACT CGTCGCACGT TGACCGTGAA CGGCGCGGTC GTCAGACCGT TGACCGTTAC CGTTCCTGTC GGCATGTCAC TGCATGAAGT GCTGGCGCTG GCGGGCGGCG CAACGGTCGA CGACCCTGGT TTTATTAACG GCGGCCCGAT GATGGGTGGC CTGATTACCT CTCTTGATAA CCCGGTGACG AAAACTACCG GCGGCCTGCT GGTGCTCCCA AAAAGCCATC CGCTTATCCA GCGGAGAATG CAGGACGAGC GCACGGTGCT TTCCGTCGCG CGCACAGTCT GCGAACAGTG CCGACTGTGT ACCGATCTCT GCCCGCGACA CCTGATCGGC CACGAACTCT CTCCGCACTT GCTGGTGCGG GCGGTGAACT TTCACCAGGC TGCCACGCCA CAGCTGCTGC TGAGCGCCCT TACCTGCTCG GAATGCAACG TTTGCGAAAG CGTGGCCTGT CCGGTGGGGA TCTCGCCGAT GCGCATCAAC CGCATGTTAA AACGCGAGCT GCGGGCGCAA AACCAGCGCT ACAAAGGGCC GCTGAATCCG TCCGACGAAA TGGCGAAATA TCGCCTGGTG CCGGTGAAGC GGCTGATCGC CAAACTGGGA CTAAGCCCCT GGTACCAGGA AGCGCCGCTG GTTGAAGACG AACCGGCTGT AGGAACAGTG ACTTTGCAAC TGCGCCAGCA TATTGGTGCC AGCGCGGTAG CGAACGTTGC GGTGGGAGAA CGCGTGACGC GCGGGCAATG CGTTGCTGAT GTCCCGCCTG GCGCGCTCGG CGCACCCATT CACGCCAGCA TCGACGGCGT TGTATCGGCC ATCAGCGAAC AGGCCATCAC GGTTGTAAGA GGTTAA
|
Protein sequence | MSAAINSVEM SHSADEIRER VRAAGVVGAG GAGFPAHVKL QAQVEIFLVN AAECEPMLKV DQQLMWQQAA RLVRGVQYAM TATGAREGVI ALKEKYRRAI DALTPLLPDG IRLHILPDVY PAGDEVLTIW MATGRRVAPA ALPASVGVVV NNVQTVLNIA RAVEQQFPVT RRTLTVNGAV VRPLTVTVPV GMSLHEVLAL AGGATVDDPG FINGGPMMGG LITSLDNPVT KTTGGLLVLP KSHPLIQRRM QDERTVLSVA RTVCEQCRLC TDLCPRHLIG HELSPHLLVR AVNFHQAATP QLLLSALTCS ECNVCESVAC PVGISPMRIN RMLKRELRAQ NQRYKGPLNP SDEMAKYRLV PVKRLIAKLG LSPWYQEAPL VEDEPAVGTV TLQLRQHIGA SAVANVAVGE RVTRGQCVAD VPPGALGAPI HASIDGVVSA ISEQAITVVR G
|
| |