Gene SeD_A3116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3116 
Symbol 
ID6874832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3001807 
End bp3002871 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content59% 
IMG OID642786140 
Productglycine betaine transporter membrane protein 
Protein accessionYP_002216786 
Protein GI198245962 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4176] ABC-type proline/glycine betaine transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value0.334495 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGATC AAACGAATCC GTGGGATACC GCACAGGTGG CCGATACTAC GACGCAAACG 
GCTGATGCCT GGGGAACACC GGCAGGCGTA GCCACGGACG GCGGTAGTAC CGACTGGTTG
AACAGCGCGC CCGCGCCAGC CCCTGAACAC TTTTCTCTTC TGGACCCGTT CCATAAGACG
CTGATCCCGC TGGATAGCTG GGTCACAGAG GGAATCGACT GGGTCGTCAC CCATTTCCGT
CCCCTTTTTC AGGGGATTCG TGTGCCGGTG GATTACATCC TCAACGGCTT TCAGCAACTG
CTGCTGGGAA TGCCCGCCCC TGTGGCGATT ATTCTCTTTG CGCTGATTGC CTGGCAGGTT
TCCGGCGTGG GCATGGGGAT CGCGACGCTG ATATCGCTGA TCGCGATCGG CGCGATCGGT
GCCTGGTCGC AGGCGATGAT TACCCTGGCG CTGGTGTTGA CCGCCCTGTT GTTCTGCGTC
GTGATCGGAT TACCGATGGG AATCTGGCTG GCGCGCAGCC CGCGTGCGGC CAAAATAGTT
CGTCCGCTGC TGGATGCGAT GCAGACCACG CCCGCGTTTG TCTATCTGGT GCCGATTGTC
ATGTTATTCG GCATCGGTAA CGTGCCGGGC GTGGTGGTGA CGATTATTTT TGCTCTACCG
CCGATTGTAC GCCTGACGAT CTTGGGCATT AACCAGGTGC CTGCCGACTT AATTGAAGCG
TCGCGCTCGT TCGGCGCCAG CCCGCGCCAA ATGTTGTTCA AAGTGCAACT ACCGCTGGCG
ATGCCCACCA TTATGGCAGG CGTTAATCAG ACGCTGATGC TGGCTCTCTC GATGGTCGTC
ATCGCCTCGA TGATTGCGGT CGGCGGACTT GGCCAGATGG TACTACGCGG CATTGGTCGT
CTTGATATGG GGCTGGCAAC CGTCGGCGGC GTCGGCATTG TGATTCTCGC CATCATTCTG
GACCGTCTGA CGCAGGCCGT CGGGCGCGAC TCGCGTAGCC GCGGTAACCG TCGCTGGTAT
ACCACCGGTC CTGTTGGGCT AATCACCCGC CCTTTCGTTA AGTAA
 
Protein sequence
MADQTNPWDT AQVADTTTQT ADAWGTPAGV ATDGGSTDWL NSAPAPAPEH FSLLDPFHKT 
LIPLDSWVTE GIDWVVTHFR PLFQGIRVPV DYILNGFQQL LLGMPAPVAI ILFALIAWQV
SGVGMGIATL ISLIAIGAIG AWSQAMITLA LVLTALLFCV VIGLPMGIWL ARSPRAAKIV
RPLLDAMQTT PAFVYLVPIV MLFGIGNVPG VVVTIIFALP PIVRLTILGI NQVPADLIEA
SRSFGASPRQ MLFKVQLPLA MPTIMAGVNQ TLMLALSMVV IASMIAVGGL GQMVLRGIGR
LDMGLATVGG VGIVILAIIL DRLTQAVGRD SRSRGNRRWY TTGPVGLITR PFVK