Gene SeSA_A4663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A4663 
Symbol 
ID6516608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp4531047 
End bp4531973 
Gene Length927 bp 
Protein Length308 aa 
Translation table11 
GC content52% 
IMG OID642749602 
Productexported membrane protein 
Protein accessionYP_002117335 
Protein GI194737970 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00303431 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATACCC AACGCCAGGC CTCCCCGTTT GCCCGCAAAA ACGTCGTTTA TGTGTGTGCC 
GCATTTTGTT GCCTGCTATG GGGCAGCGCT TATCCAGCCA TCAAAAGCGG TTATGACCTC
TTTCAGATAG CCACCGACGA TATCCCTTCT AAAATTGTTT TTGCTGGTTA TCGTTTTTTG
TTTGCGGGTG GGTTGCTGCT ACTGTTCGCG CTGCTTCAGC GTAAACCGAT TGGTCGGTTT
CGTCCGCGCC AGTTTGCTCA GTTAACGTTA CTGGGGCTGA CTCAGACGTC GCTACAATAT
CTCTTTTTCT ATATCGGCCT TGCGTTCACC TCCGGCGTGA AAGGCTCAAT CATGAACGCG
ACAGGCACAT TCTTCAGCGT ATTGCTGGCG CACTTTATTT ATCAGAACGA CCGATTGAGC
TACAACAAAA CGCTCGGCTG TATTCTGGGC TTTGCGGGCG TCATGGTGGT GAACGTCAGC
AACGGCCTGG ATTTCAGCTT TAATCTGCCG GGAGAAGGCT CCGTGGTGCT GGCGGCGTTT
ATTCTTTCTG CGGCCACGTT GTATGGCAAA CGTCTCTCGC AGACGGTCGA TCCGATGGTC
ATGACTGGCT ATCAATTGGG GATTGGCGGT CTGGTACTGG TCATTGGCGG CTACGTTTTT
GGCGGTACGC TGACGATACA TGGCTTCTCG TCGGTGGCGA TTTTAGTCTA CCTGACGTTG
CTCTCGTCGG TCGCTTTTGC GCTATGGAGC ATTTTACTCA AATATAATCG CGTGGGGATG
ATTGCGCCGT TTAACTTTCT GATCCCGGTT TCCGGCGCGG CTCTTTCGGC TATTTTCCTC
GGCGAGAATA TTCTGGAGTG GAAATACATG ATTGCGCTGG TGCTGGTGTG TTCGGGGATC
TGGTGGGTGA ATAAGGTGAA GCGGTAA
 
Protein sequence
MDTQRQASPF ARKNVVYVCA AFCCLLWGSA YPAIKSGYDL FQIATDDIPS KIVFAGYRFL 
FAGGLLLLFA LLQRKPIGRF RPRQFAQLTL LGLTQTSLQY LFFYIGLAFT SGVKGSIMNA
TGTFFSVLLA HFIYQNDRLS YNKTLGCILG FAGVMVVNVS NGLDFSFNLP GEGSVVLAAF
ILSAATLYGK RLSQTVDPMV MTGYQLGIGG LVLVIGGYVF GGTLTIHGFS SVAILVYLTL
LSSVAFALWS ILLKYNRVGM IAPFNFLIPV SGAALSAIFL GENILEWKYM IALVLVCSGI
WWVNKVKR