Gene SeD_A3938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3938 
Symbol 
ID6874017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3774280 
End bp3775383 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content56% 
IMG OID642786897 
Productleucine-specific-binding protein 
Protein accessionYP_002217525 
Protein GI198243020 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones83 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATGA AGGGTAAAAC GTTATTGGCA GGATGTATCG CCCTGTCATT AAGCCATATG 
GCATTCGCAG AGGACATTAA AGTCGCGGTT GTCGGCGCGA TGTCCGGTCC GGTGGCGCAG
TATGGCGACC AGGAATTTAC CGGTGCGGAG CAGGCAATTG CCGATATCAA TGCGAAAGGC
GGTATTAAAG GCGATAAGCT CGTCGCGGTG AAATATGACG ACGCCTGCGA CCCGAAACAG
GCGGTAGCTG TCGCTAACAA AGTGGTGAAC GACGGCATCA AGTATGTTAT CGGACACTTA
TGCTCCTCTT CCACACAGCC AGCGTCTGAT ATCTACGAAG ACGAAGGCAT CCTGATGATA
ACCCCGGCGG CGACCGCGCC GGAGCTGACC GCGCGCGGCT ATAAGCTGGT TTTGCGCACC
ACCGGTCTGG ACTCTGACCA GGGGCCAACG GCGGCAAAGT ATATTCTGGA GAAGGTAAAA
CCGCAGCGCA TCGCGATTAT CCACGATAAG CAGCAGTACG GCGAAGGGCT GGCGCGCGCG
GTGCAGGACG GTCTGAAGAA AGGCGGCGTT AACGTCGTAT TCTTTGACGG CATCACCGCC
GGCGAAAAAG ATTTCTCCAC TCTGGTAGCG CGTCTGAAAA AAGAGAATAT CGACTTTGTC
TACTACGGCG GTTATCACCC GGAAATGGGG CAGATCCTGC GTCAGTCTCG CGCCGCAGGG
CTGAAAACCC AGTTTATGGG GCCGGAAGGG GTGGCGAACG TGTCGCTGTC TAACATCGCC
GGAGAGTCGG CGGAAGGCTT ACTGGTCACC AAACCGAAGA ACTACGACCA GGTCCCGGCG
AACAAACCGA TTGTGGATGC TATCAAAGCC AAGAAACAAG ATCCTAGCGG CGCGTTTGTC
TGGACCACCT ACGCCGCGCT GCAATCGTTG CAGGCGGGGC TGAACCACTC CGACGATCCG
GCGGAAATCG CCAAATACCT GAAAGGCGCC ACGGTCGACA CCGTAATGGG ACCGCTGTCG
TGGGATGAGA AAGGCGATCT GAAAGGATTT GAGTTCGGCG TGTTTGACTG GCATGCGAAT
GGTACGGCGA CGGACGCCAA GTGA
 
Protein sequence
MNMKGKTLLA GCIALSLSHM AFAEDIKVAV VGAMSGPVAQ YGDQEFTGAE QAIADINAKG 
GIKGDKLVAV KYDDACDPKQ AVAVANKVVN DGIKYVIGHL CSSSTQPASD IYEDEGILMI
TPAATAPELT ARGYKLVLRT TGLDSDQGPT AAKYILEKVK PQRIAIIHDK QQYGEGLARA
VQDGLKKGGV NVVFFDGITA GEKDFSTLVA RLKKENIDFV YYGGYHPEMG QILRQSRAAG
LKTQFMGPEG VANVSLSNIA GESAEGLLVT KPKNYDQVPA NKPIVDAIKA KKQDPSGAFV
WTTYAALQSL QAGLNHSDDP AEIAKYLKGA TVDTVMGPLS WDEKGDLKGF EFGVFDWHAN
GTATDAK