Gene SeD_A3117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3117 
SymbolproX 
ID6875441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3002941 
End bp3003936 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content54% 
IMG OID642786141 
Productglycine betaine transporter periplasmic subunit 
Protein accessionYP_002216787 
Protein GI198244108 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value0.293613 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACATA CCGTGATATT TGCCTCAGCG TTTGCCACCC TTGTCACCGC CAGCGCTTTC 
GCTGCCGACC TGCCTGGCAA AGGCATTACC GTCCAACCTA TCCAAAGCAC GATTTCTGAA
GAGACCTTCC AGACGTTACT GGTCAGTCGG GCGCTGGAAA AGTTGGGTTA TACCGTAAAT
AAGCCGAGTG AAGTCGATTA CAACGTAGGC TATACCTCCA TCGCCTCTGG CGATGCGACG
TTTACCGCCG TGAACTGGCA GCCGCTGCAT GATGATATGT ATGCCGCAGC AGGCGGAGAC
AACAAGTTTT ACCGCGAGGG CGTCTTCGTC TCCGGCGCTG CGCAGGGCTA TCTGATCGAT
AAAAAAACCG CCGAGCAGTA CAACATCACC AATATCGCTC AGCTAAAAGA TCCGAAAATC
GCCAAAATCT TCGATACCAA TGGCGACGGA AAAGCGGACA TGATGGGCTG CTCGCCGGGC
TGGGGCTGCG AAGCCGTGAT CAATCATCAG AATAAAGCGT TTGATCTGCA AAAAACCGTT
GAGGTCAGCC ACGGTAACTA TGCGGCGATG ATGGCTGATA CCATTACCCT TTTTAAAGAA
GGCAAGCCGG TGCTGTATTA CACCTGGACC CCGTACTGGG TGAGCGACGT AATGAAGCCG
GGTAAAGATG TCGTCTGGCT ACAGGTGCCG TTCTCCTCCC TGCCGGGCGA ACAGAAAAAT
ATTGATACTA AACTGCCGAA CGGCGCGAAC TATGGGTTCC CGGTCAATAC TATGCATATT
GTCGCCAATA AGGCATGGGC GGAGAAAAAC CCGGCGGCGG CGAAACTGTT CGCCATCATG
AAGCTGCCGC TGGCGGATAT CAACGCGCAG AACGCCATGA TGCATGCCGG TAAATCGTCT
GAAGCCGATG TTCAGGGCCA CGTAGACGGC TGGATCAACG CCCACCAGCA GCAGTTTGAC
GGCTGGGTGA AAGAAGCGCT GGCCGCGCAG AAATAA
 
Protein sequence
MRHTVIFASA FATLVTASAF AADLPGKGIT VQPIQSTISE ETFQTLLVSR ALEKLGYTVN 
KPSEVDYNVG YTSIASGDAT FTAVNWQPLH DDMYAAAGGD NKFYREGVFV SGAAQGYLID
KKTAEQYNIT NIAQLKDPKI AKIFDTNGDG KADMMGCSPG WGCEAVINHQ NKAFDLQKTV
EVSHGNYAAM MADTITLFKE GKPVLYYTWT PYWVSDVMKP GKDVVWLQVP FSSLPGEQKN
IDTKLPNGAN YGFPVNTMHI VANKAWAEKN PAAAKLFAIM KLPLADINAQ NAMMHAGKSS
EADVQGHVDG WINAHQQQFD GWVKEALAAQ K