Gene SeD_A2609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2609 
Symbol 
ID6873157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2485429 
End bp2486481 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content57% 
IMG OID642785679 
Productthiamine biosynthesis lipoprotein ApbE 
Protein accessionYP_002216336 
Protein GI198242105 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value0.425275 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATGA CTTTTTGCCG GGCCGTGTGT CTGGCGGCGG CTTTTCTACT TATGGGCTGC 
GATGAGGCTC CCGAAACGAC AACAGCGTCA CCTGCCGCTC AGGTGCTGGA AGGTAAAACG
ATGGGGACCC TCTGGCGGGT GAGCGTGGTT GGTATCGATG CGAAACGCGC CGCAGAGTTA
CAGACTAAAA TCCAGACTCA GCTTGATGCT GATGATTGGT TGCTTTCTAC CTATAAAAAT
GACTCCGCGC TGATGCGTTT TAACCATTCA CGCAGCCTTG CGCCCTGGCC GGTCAGCGAA
GCCATGGCGG ATATCGTGAC CTCGGCGCTA CGTATTGGCG CGAAGACGGA CGGCGCGATG
GATATCACCG TGGGCCCGCT GGTCAATCTG TGGGGGTTTG GGCCGGATCG GCAGCCGATG
CATATCCCAA CACCAGCACA AATCGATGCG GCAAAAGCGA AAACAGGCCT GCAACATTTG
CAGGTTATCG ACAGGGCTGG ACATCAGTTT TTGCAAAAAG ATCTGCCGGA TCTTTATGTT
GATCTCTCCA CGGTCGGGGA GGGCTATGCG GCGGATCATC TGGCGCGACT GATGGAGCAG
GAGGGCATTG CGCGTTATCT GGTCTCGGTG GGGGGCGCAT TAAGCAGCCG CGGGATGAAT
GCGCAGGGGC AGCCGTGGCG CGTCGCGATT CAGAAGCCGA CCGACCGGGA AAACGCGGTG
CAGGCGATTG TGGATATCAA CGGGCATGGC ATCAGCACCT CCGGCAGCTA CCGTAACTAT
TATGAGCTGG ATGGCAAGCG TATCTCGCAC GTTATCGATC CGCAAACGGG GCGCCCCATT
GAACACAACC TGGTATCGGT TACGGTCATC GCGCCAACGG CGCTGGAAGC GGACGGCTGG
GACACCGGCC TGATGGTGCT CGGTACGCAA AAGGCGCAAG AGGTCGTGCG GCGGGAAGGG
CTGGCGGTCT TTATGATCAT GAAAGAAGGT GAAGGCTTTA AAACCTGGAT GTCGCCGCAG
TTCAAAACGT TCATGGTGAG CGATAAGAAT TAA
 
Protein sequence
MKMTFCRAVC LAAAFLLMGC DEAPETTTAS PAAQVLEGKT MGTLWRVSVV GIDAKRAAEL 
QTKIQTQLDA DDWLLSTYKN DSALMRFNHS RSLAPWPVSE AMADIVTSAL RIGAKTDGAM
DITVGPLVNL WGFGPDRQPM HIPTPAQIDA AKAKTGLQHL QVIDRAGHQF LQKDLPDLYV
DLSTVGEGYA ADHLARLMEQ EGIARYLVSV GGALSSRGMN AQGQPWRVAI QKPTDRENAV
QAIVDINGHG ISTSGSYRNY YELDGKRISH VIDPQTGRPI EHNLVSVTVI APTALEADGW
DTGLMVLGTQ KAQEVVRREG LAVFMIMKEG EGFKTWMSPQ FKTFMVSDKN