Gene SeD_A2005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2005 
Symbol 
ID6874663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1937087 
End bp1938067 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content60% 
IMG OID642785120 
Productvtamin B12-transporter permease 
Protein accessionYP_002215786 
Protein GI198244726 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4139] ABC-type cobalamin transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value0.509358 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGACTT TTGCCCGCCA ACAACAGCGA CGAAACGTTC GCTGGCTTCT GAGCCTGTCA 
CTGCTGGTGC TACTGGCTAC ACTTCTGAGC TTATGCGCAG GCGAACAGTG GATTGCCCCC
GGTGACTGGT TAAGCGCCCG GGGGGAACTG TTTGTCTGGC AAATTCGCCT TCCCCGCACG
CTTGCGGTAT TGCTGGTTGG CGCTGCGCTG GCGCTATCTG GCGCCGTGAT GCAGGCGCTG
TTTGAAAACC CACTTGCTGA ACCGGGTCTG CTCGGCGTTT CGAATGGGGC CGGTGTTGGG
CTTATTGCCG CCGTCTTACT GGGGCAGGGG CAACTGCCAG GATGGGCGCT GGGACTGTGC
GCTATAGCCG GCGCGCTCAT TATTACGTTA ATCCTGCTGC GTTTTGCGCG TCGCCATCTT
TCTACCAGCC GCTTGTTGTT GGCGGGCGTC GCGCTGGGCA TTATCTGTAG CGCGCTGATG
ACGTGGGCTA TCTATTTTTC CACCTCTTTC GATCTGCGGC AATTAATGTA CTGGATGATG
GGAGGATTTG GCGGCGTTGA CTGGCAGCAG AGCTGGCTAA TGATTGCGCT CATCCCGGTA
CTGATCTGGA TATGTTGCCA GTCGCAACCG CTGAATATGC TGGCGCTAGG GGAAACCTCG
GCGCGGCAGC TTGGCCTGCC GCTGTGGTTC TGGCGCAATT TGTTGGTCGT CGCCACTGGC
TGGATGGTGG GCGTCAGCGT GGCGATGGCG GGGGCGATTG GTTTTATCGG TCTGGTTATT
CCGCACATCC TGCGCTTATG TGGTTTAACC GATCACCGGG TTTTACTTCC CGGCTGCGCG
CTGGCCGGGG CTATCGCCCT GCTATTGGCT GATGTGGTCG CCCGGCTGGC GCTGGCGTCG
GCTGAACTGC CTATCGGGGT GGTCACCGCC ACATTGGGGG CGCCAGTGTT TATCTGGCTG
CTGCTCAAAT CCGCGCGTTA G
 
Protein sequence
MLTFARQQQR RNVRWLLSLS LLVLLATLLS LCAGEQWIAP GDWLSARGEL FVWQIRLPRT 
LAVLLVGAAL ALSGAVMQAL FENPLAEPGL LGVSNGAGVG LIAAVLLGQG QLPGWALGLC
AIAGALIITL ILLRFARRHL STSRLLLAGV ALGIICSALM TWAIYFSTSF DLRQLMYWMM
GGFGGVDWQQ SWLMIALIPV LIWICCQSQP LNMLALGETS ARQLGLPLWF WRNLLVVATG
WMVGVSVAMA GAIGFIGLVI PHILRLCGLT DHRVLLPGCA LAGAIALLLA DVVARLALAS
AELPIGVVTA TLGAPVFIWL LLKSAR