Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A2406 |
Symbol | |
ID | 6483859 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 2322705 |
End bp | 2323718 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642737745 |
Product | cobalamin synthesis protein P47K |
Protein accession | YP_002041487 |
Protein GI | 194444356 |
COG category | [R] General function prediction only |
COG ID | [COG0523] Putative GTPases (G3E family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.000018505 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTGTATC AATATTCTGG AGCTGACGTG ACCAAAACCA ATCTTATTAC TGGATTTCTC GGTAGTGGAA AAACCACCTC TATCCTTCAT TTATTAGCTC ATAAAGATCC GGCTGAAAAG TGGGCCGTCC TGGTTAATGA ATTTGGTGAA GTGGGTATTG ACGGCGCGCT GCTTGCCGAC AGCGGCGCAC TGCTAAAAGA GATCCCCGGC GGCTGCATGT GCTGCGTCAA TGGATTGCCT ATGCAGGTGG GGCTCAACAC GCTGCTGCGC CAGGGCAAAC CTGACCGGTT GCTGATTGAA CCAACCGGAC TGGGACACCC AAAACAGATT CTGGATTTAT TAACTGCGCC GGTTTATGAG CCGTGGATTG ATTTACGCGC CACGCTCTGC ATCCTTGACC CTCGCCTGCT ACTGGACCAA CAGAGCGTCG CCAATGAAAA TTTCCGCGAT CAGCTCGCCT CAGCCGATAT TATCATCGCC AATAAGACCG ATCGCGCCAC GGCGCAGAGC GATGCCGCCC TGCAACAGTG GTGGCGACAG TACGGCGGCG ATCGTCAACT GATTCATGCC GAACATGGAC AGATAGACGG TAAGCTTCTT GATTTACCAC GGCAAAATCT GGCGGAACTG CCGGCCAGCG CCGCGCATTC TCACACTCAT ACCAGTAAAA AAGGACTCGC CGCGCTAAAT CTGCCCGCCC AGCAGCGCTG GCGACGCAGC CTCAATAGCG GACAGGGTCA TCAGGCCTGC GGCTGGATTT TCGATGCCGA TACCGTGTTT GACACCATTG GCCTACTCGA ATGGGCGCGT CTGGCGCCGG TGGGCCGGGT GAAAGGCGTT ATGCGCATAC AAGAGGGGCT GGTACGCATC AATCGCCAGG GCGATGACCT GCACATCGAA ACACAGAGTG TCGCGCCGCC GGATAGCCGG GTTGAACTTA TCTCAAACAC AGAAACCGAC TGGAATACGT TACAGACGGC CTTGTTGAAG CTTCGTTTAG CGACGCACGC GTAA
|
Protein sequence | MLYQYSGADV TKTNLITGFL GSGKTTSILH LLAHKDPAEK WAVLVNEFGE VGIDGALLAD SGALLKEIPG GCMCCVNGLP MQVGLNTLLR QGKPDRLLIE PTGLGHPKQI LDLLTAPVYE PWIDLRATLC ILDPRLLLDQ QSVANENFRD QLASADIIIA NKTDRATAQS DAALQQWWRQ YGGDRQLIHA EHGQIDGKLL DLPRQNLAEL PASAAHSHTH TSKKGLAALN LPAQQRWRRS LNSGQGHQAC GWIFDADTVF DTIGLLEWAR LAPVGRVKGV MRIQEGLVRI NRQGDDLHIE TQSVAPPDSR VELISNTETD WNTLQTALLK LRLATHA
|
| |