Gene SeD_A4791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4791 
Symbol 
ID6873015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4644649 
End bp4645575 
Gene Length927 bp 
Protein Length308 aa 
Translation table11 
GC content51% 
IMG OID642787682 
Producthypothetical protein 
Protein accessionYP_002218276 
Protein GI198242021 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00231071 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones78 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATACCC AACGCCAGGC CTCCCCGTTT GCCCGCAAAA ACGTCGTTTA TGTGTGTGCC 
GCATTTTGTT GCTTGCTATG GGGCAGCGCT TATCCAGCCA TCAAAAGCGG TTATGACCTC
TTTCAGATAG CCACCGATGA TATCCCTTCT AAAATTGTTT TTGCTGGTTA TCGTTTTTTG
TTTGCGGGTG GGTTGCTACT ACTGTTCGCG CTGCTTCAGC GTAAACCGAT TGGTCGGTTT
CGTCCGCGCC AGTTTGCTCA GTTGACGTTA CTGGGGCTGA CTCAGACGTC GCTGCAATAT
CTCTTTTTCT ATATCGGCCT TGCGTTCACC TCCGGCGTGA AAGGCTCAAT CATGAACGCG
ACAGGCACAT TCTTCAGCGT ATTGCTGGCG CACTTTATTT ATCAGAACGA CCGATTGAGC
TACAACAAAA CGCTCGGCTG TATTCTGGGC TTTGCGGGCG TCATGGTGGT GAACGTCAGC
AACGGCCTGG ATTTCAGCTT TAATCTGCCG GGAGAAGGCT CCGTGGTGCT GGCGGCGTTT
ATTCTTTCTG CGGCCACATT GTATGGCAAA CGTCTCTCGC AGACGGTCGA TCCGATGGTC
ATGACTGGCT ATCAATTGGG GATTGGCGGT CTGGTACTGG TCATTGGCGG TTACGTTTTT
GGCGGTACGC TGACGATACA TGGCTTCTCG TCGGTGGCGA TTTTAGTCTA CCTGACGCTG
CTCTCGTCGG TCGCTTTTGC GCTATGGAGC ATTTTACTCA AATATAATCG CGTGGGGATG
ATTGCGCCGT TTAACTTTCT GATCCCGGTT TCTGGCGCGG CTCTTTCGGC TATTTTTCTC
GGCGAGAATA TATTGGAGTG GAAATACATG ATTGCGCTGG TGCTGGTGTG TTCGGGGATT
TGGTGGGTGA ATAAGGTGAA GCGGTAA
 
Protein sequence
MDTQRQASPF ARKNVVYVCA AFCCLLWGSA YPAIKSGYDL FQIATDDIPS KIVFAGYRFL 
FAGGLLLLFA LLQRKPIGRF RPRQFAQLTL LGLTQTSLQY LFFYIGLAFT SGVKGSIMNA
TGTFFSVLLA HFIYQNDRLS YNKTLGCILG FAGVMVVNVS NGLDFSFNLP GEGSVVLAAF
ILSAATLYGK RLSQTVDPMV MTGYQLGIGG LVLVIGGYVF GGTLTIHGFS SVAILVYLTL
LSSVAFALWS ILLKYNRVGM IAPFNFLIPV SGAALSAIFL GENILEWKYM IALVLVCSGI
WWVNKVKR