Gene Smed_3839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3839 
Symbol 
ID5318567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp294180 
End bp295163 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content60% 
IMG OID640775651 
ProductTRAP transporter solute receptor TAXI family protein 
Protein accessionYP_001312584 
Protein GI150375988 
COG category[R] General function prediction only 
COG ID[COG2358] TRAP-type uncharacterized transport system, periplasmic component 
TIGRFAM ID[TIGR02122] TRAP transporter solute receptor, TAXI family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0135422 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACATG AACGGATTAC GTTGCGAGGA GCGCAGATCG CCGCCCTCTC GGCTGCGATG 
TTTTTTTCGG GCAGCGCGGT GGCGCAGCAG AAATTCGTGA CGATCGGCAC TGGCGGCGTC
ACCGGCGTCT ATTATGCAGC GGGCGGCGCC ATCTGCCGGC TCCTGAACAA GGACCGCAAG
ACGCACGGAA TTCGCTGTTC GGTCGAATCC ACCGGTGGAT CGGCGTTCAA CGTGAATACG
ATCAAGGAAG GCGAGCTCGA TTTCGGCACG ACACAGTCCG ACGTCCAGTA CAACGCCATG
AAGGGCGAGG AATCGTTCAA GGAAGGCGGC GCGCATACCG ACCTCAGAGC GGTATTCTCG
ATCCATCCCG AACCATTCAC CGTACTGGCT CATCCGAATG CCGGCGTGAC CAGGTTCGAG
GATTTCAAGG GCAAGCGCTT CAATGTGGGC AATCCGGGCT CCGGCACGCG CGCTTCGATG
GAACGTCTGC TCGGCGCGAT GGGCTGGACG CTGGCCGACT TCTCCCTCGC ATCCGAGCTC
AAGGCCGATG AGCACGGGCC GGCGCTTTGC GACGGCAAGA TCGACGGCTT TTTCTACGGC
GTCGGACATC CCTCGGCCAA TATCCAGGAT CCGACGACCA CATGCGGCGC AAAGCTGGTG
CCGCTGACCG GCGAAGTCGT CGACAAACTG GTCGCCGAAA ACCCCTATTA TGCAAAGGCG
ACCATTCCCG GCGGCCTCTA CAACGGCAAT CCGGAAGACA CGGAAACGTT TGGCGTTCTT
GCCACGCTGG TCACATCGGC CAATGTACCG GAAGAGAGCG TTTACGAACT CACGAAAGCG
GTTTTCGAGA ACTTCGACGA GTTCAAGTCG CTGCACCCGG CCTTCGCCAA TCTCGATCCC
GCCAAGATGA TCAAGGATGG CCTCTCGGCG CCGCTGCACC CCGGCGCGGA AAGATACTAC
AAGGAAAAGG GCTGGCTGAA GTGA
 
Protein sequence
MRHERITLRG AQIAALSAAM FFSGSAVAQQ KFVTIGTGGV TGVYYAAGGA ICRLLNKDRK 
THGIRCSVES TGGSAFNVNT IKEGELDFGT TQSDVQYNAM KGEESFKEGG AHTDLRAVFS
IHPEPFTVLA HPNAGVTRFE DFKGKRFNVG NPGSGTRASM ERLLGAMGWT LADFSLASEL
KADEHGPALC DGKIDGFFYG VGHPSANIQD PTTTCGAKLV PLTGEVVDKL VAENPYYAKA
TIPGGLYNGN PEDTETFGVL ATLVTSANVP EESVYELTKA VFENFDEFKS LHPAFANLDP
AKMIKDGLSA PLHPGAERYY KEKGWLK