Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A2450 |
Symbol | |
ID | 6484631 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 2366039 |
End bp | 2367091 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642737787 |
Product | thiamine biosynthesis lipoprotein ApbE |
Protein accession | YP_002041528 |
Protein GI | 194446152 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 0.0247044 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATGA CTTTTTGCCG GGCCGTGTGT CTGGCGGCGG CTTTTTTACT TATGGGCTGC GATGAGGCTC CCGAAACGAC AACAGCGTCA CCTGCCGCTC AGGTGCTGGA AGGTAAAACG ATGGGGACCC TCTGGCGGGT GAGCGTGGTT GGTATCGATG CGAAACGCGC CGCAGAGTTA CAGACTAAAA TCCAGACTCA GCTTGATGCT GATGATTGGT TGCTTTCTAC CTATAAAAAT GACTCCGCGT TGATGCGTTT TAACCATTCA CGCAGTCTTG CGCCCTGGCC GGTCAGCGAA GCCATGGCGG ATATCGTGAC CTCGGCGCTG CGTATTGGCG CGAAGACGGA CGGCGCGATG GATATCACCG TGGGCCCGCT GGTCAATCTG TGGGGGTTTG GGCCGGATCG GCAGCCGCTG CATATCCCAA CACCAGCACA AATCGATGCG GCAAAAGCGA AAACAGGCCT GCAACATTTG CAGGTTATCG ACAGGGCTGG ACATCAGTTT TTGCAAAAAG ATCTGCCGGA TCTTTATGTT GATCTCTCCA CGGTCGGGGA GGGCTATGCG GCGGATCACC TGGCGCGTCT GATGGAGCAG GAGGGCATTG CGCGTTATCT GGTCTCGGTG GGCGGCGCAT TAAGCAGTCG CGGGATGAAT GCGCAAGGGC TGCCGTGGCG CGTCGCGATT CAGAAGCCGA CCGACCGGGA AAACGCGGTG CAGGCGATTG TGGATATCAA CGGGCATGGC ATCAGCACCT CCGGCAGCTA CCGTAACTAT TATGAGCTGG ATGGCAAGCG CGTATCGCAC GTTATCGATC CGCAAACGGG GCGCCCCATT GAACACAACC TGGTATCGGT TACGGTCATC GCGCCAACGG CGCTGGAAGC GGACGGCTGG GACACCGGCC TGATGGTGCT TGGTACGCAA AAGGCGCAAG AGGTCGTGCG GCGGGAAGGG CTGGCGGTCT TTATGATCAT GAAAGAAGGT GAAGGCTTTA AAACCTGGAT GTCGCCGCAG TTCAAAACGT TCCTGGTGAG CGATAAGAAT TAA
|
Protein sequence | MKMTFCRAVC LAAAFLLMGC DEAPETTTAS PAAQVLEGKT MGTLWRVSVV GIDAKRAAEL QTKIQTQLDA DDWLLSTYKN DSALMRFNHS RSLAPWPVSE AMADIVTSAL RIGAKTDGAM DITVGPLVNL WGFGPDRQPL HIPTPAQIDA AKAKTGLQHL QVIDRAGHQF LQKDLPDLYV DLSTVGEGYA ADHLARLMEQ EGIARYLVSV GGALSSRGMN AQGLPWRVAI QKPTDRENAV QAIVDINGHG ISTSGSYRNY YELDGKRVSH VIDPQTGRPI EHNLVSVTVI APTALEADGW DTGLMVLGTQ KAQEVVRREG LAVFMIMKEG EGFKTWMSPQ FKTFLVSDKN
|
| |