Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A2230 |
Symbol | |
ID | 6486850 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 2137417 |
End bp | 2138772 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 642737578 |
Product | polyhedral body protein |
Protein accession | YP_002041320 |
Protein GI | 194442756 |
COG category | [C] Energy production and conversion |
COG ID | [COG4656] Predicted NADH:ubiquinone oxidoreductase, subunit RnfC |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0520726 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 81 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCCG CCATCAATAG CGTTGAAATG TCCCATAGCG CCGATGAAAT TCGCGAGCGC GTCCGCGCAG CGGGCGTTGT CGGCGCGGGC GGCGCAGGTT TTCCCGCTCA CGTCAAACTA CAGGCGCAGG TAGAGATTTT TCTGGTGAAC GCCGCCGAGT GTGAACCGAT GCTCAAAGTT GACCAGCAAC TGATGTGGCA GCAAACTGCG CGCCTTGTGC GCGGCGTACA GTACGCCATG ACGGCAACCG GCGCGCGCGA AGGGGTGATA GCCCTGAAAG AAAAATACCG CCGGGCCATC GACGCCCTCA CCCCACTGCT GCCAGACGGT ATCCGCCTGC ATATCCTGCC GGATGTTTAT CCTGCGGGCG ATGAGGTGCT AACTATCTGG ATGGCAACCG GCCGCCGGGT CGCCCCGGCT GCGCTACCGG CCAGCGTCGG CGTGGTCGTC AATAACGTGC AAACGGTGCT GAATATTGCC CGGGCGATCG AGCAGCAGTT TCCGGTCACT CGTCGCACGT TGACCGTGAA CGGCGCGGTC GCCAGACCGT TGACCGTTAC CGTTCCTATC GGCATGTCAC TGCATGAAGT GCTGGCGCTG GCGGGCGGCG CAACGGTCGA CGACCCTGGT TTTATTAACG GCGGCCCGAT GATGGGTGGT CTGATTACCT CTCTTGATAA CCCGGTGACG AAAACTACCG GCGGCCTGCT GGTGCTCCCA AAAAGCCATC CGCTTATCCA GCGCAGGATG CAGGACGAGC GCACAGTGCT TTCCGTCGCG CGCACCGTCT GCGAACAGTG CCGACTCTGT ACCGATCTCT GCCCACGACA TCTGATCGGC CACGAACTCT CTCCGCACCT GCTGGTGCGG GCGGTGAACT TTCATCAGGC TGCCACGCCG CAACTGCTGC TGAGCGCCCT TACCTGCTCG GAATGCAACA TTTGCGAAAG CGTGGCCTGC CCGGTCGGGA TCTCGCCGAT GCGCATCAAC CGCATGTTAA AACGCGAGCT GCGGGCGCAA AACCAGCGCT ACGAAGGGCC GCTGTATCCA GCCGACGAGA TGGCGAAATA TCGCCTCGTG CCAGTGAAGC GGCTGATCGC CAAACTGGGA CTAAGCCCCT GGTACCAGGA AGCGCCGCTG GTTGAAGAAG AGCCGTCAGT AGAAAAAGTC ACTTTGCAGC TGCGCCAGCA TATTGGTGCC AGCGCAGTAC CGACTGTTGC CGTCGGCGAG CGCGTGACGC GCGGGCAATG CGTTGCCGAT GTCCCGCCTG GCGCGCTCGG CGCATCCATT CACGCCAGCA TCGACGGCGT TGTATCGGCC ATCAGCGAAC AGGCCATCAC GGTTGTAAGA GGTTAA
|
Protein sequence | MSAAINSVEM SHSADEIRER VRAAGVVGAG GAGFPAHVKL QAQVEIFLVN AAECEPMLKV DQQLMWQQTA RLVRGVQYAM TATGAREGVI ALKEKYRRAI DALTPLLPDG IRLHILPDVY PAGDEVLTIW MATGRRVAPA ALPASVGVVV NNVQTVLNIA RAIEQQFPVT RRTLTVNGAV ARPLTVTVPI GMSLHEVLAL AGGATVDDPG FINGGPMMGG LITSLDNPVT KTTGGLLVLP KSHPLIQRRM QDERTVLSVA RTVCEQCRLC TDLCPRHLIG HELSPHLLVR AVNFHQAATP QLLLSALTCS ECNICESVAC PVGISPMRIN RMLKRELRAQ NQRYEGPLYP ADEMAKYRLV PVKRLIAKLG LSPWYQEAPL VEEEPSVEKV TLQLRQHIGA SAVPTVAVGE RVTRGQCVAD VPPGALGASI HASIDGVVSA ISEQAITVVR G
|
| |