Gene SNSL254_A2450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2450 
Symbol 
ID6484631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2366039 
End bp2367091 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content57% 
IMG OID642737787 
Productthiamine biosynthesis lipoprotein ApbE 
Protein accessionYP_002041528 
Protein GI194446152 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.0247044 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATGA CTTTTTGCCG GGCCGTGTGT CTGGCGGCGG CTTTTTTACT TATGGGCTGC 
GATGAGGCTC CCGAAACGAC AACAGCGTCA CCTGCCGCTC AGGTGCTGGA AGGTAAAACG
ATGGGGACCC TCTGGCGGGT GAGCGTGGTT GGTATCGATG CGAAACGCGC CGCAGAGTTA
CAGACTAAAA TCCAGACTCA GCTTGATGCT GATGATTGGT TGCTTTCTAC CTATAAAAAT
GACTCCGCGT TGATGCGTTT TAACCATTCA CGCAGTCTTG CGCCCTGGCC GGTCAGCGAA
GCCATGGCGG ATATCGTGAC CTCGGCGCTG CGTATTGGCG CGAAGACGGA CGGCGCGATG
GATATCACCG TGGGCCCGCT GGTCAATCTG TGGGGGTTTG GGCCGGATCG GCAGCCGCTG
CATATCCCAA CACCAGCACA AATCGATGCG GCAAAAGCGA AAACAGGCCT GCAACATTTG
CAGGTTATCG ACAGGGCTGG ACATCAGTTT TTGCAAAAAG ATCTGCCGGA TCTTTATGTT
GATCTCTCCA CGGTCGGGGA GGGCTATGCG GCGGATCACC TGGCGCGTCT GATGGAGCAG
GAGGGCATTG CGCGTTATCT GGTCTCGGTG GGCGGCGCAT TAAGCAGTCG CGGGATGAAT
GCGCAAGGGC TGCCGTGGCG CGTCGCGATT CAGAAGCCGA CCGACCGGGA AAACGCGGTG
CAGGCGATTG TGGATATCAA CGGGCATGGC ATCAGCACCT CCGGCAGCTA CCGTAACTAT
TATGAGCTGG ATGGCAAGCG CGTATCGCAC GTTATCGATC CGCAAACGGG GCGCCCCATT
GAACACAACC TGGTATCGGT TACGGTCATC GCGCCAACGG CGCTGGAAGC GGACGGCTGG
GACACCGGCC TGATGGTGCT TGGTACGCAA AAGGCGCAAG AGGTCGTGCG GCGGGAAGGG
CTGGCGGTCT TTATGATCAT GAAAGAAGGT GAAGGCTTTA AAACCTGGAT GTCGCCGCAG
TTCAAAACGT TCCTGGTGAG CGATAAGAAT TAA
 
Protein sequence
MKMTFCRAVC LAAAFLLMGC DEAPETTTAS PAAQVLEGKT MGTLWRVSVV GIDAKRAAEL 
QTKIQTQLDA DDWLLSTYKN DSALMRFNHS RSLAPWPVSE AMADIVTSAL RIGAKTDGAM
DITVGPLVNL WGFGPDRQPL HIPTPAQIDA AKAKTGLQHL QVIDRAGHQF LQKDLPDLYV
DLSTVGEGYA ADHLARLMEQ EGIARYLVSV GGALSSRGMN AQGLPWRVAI QKPTDRENAV
QAIVDINGHG ISTSGSYRNY YELDGKRVSH VIDPQTGRPI EHNLVSVTVI APTALEADGW
DTGLMVLGTQ KAQEVVRREG LAVFMIMKEG EGFKTWMSPQ FKTFLVSDKN