Gene Smed_3302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3302 
Symbol 
ID5324186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3492854 
End bp3493834 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content64% 
IMG OID640792254 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_001328959 
Protein GI150398492 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4608] ABC-type oligopeptide transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAATA CGGCGCCGCT TCTCGACGTC AGCGGGCTCA CCAAGCGCTT TTCGGTCAAG 
GGCCGCGGCC CCAAGGGTGA AAAACGCTTC GTGCATGCCG TCGATAATGT CAGCTTTACC
ATTGCCCGCG GCGAAGTCGT CGGTCTTGTG GGGGAATCGG GCTCCGGCAA AACCACGATC
GGCCGCACGC TGATGCGGCT GACCGATCCG ACAGCAGGCG TGATCGCCTA TGAAGGCACG
GATATCGCCC ATCTCCCCGC CCGCGAGATG ATGGCGTTCC GGCGCAAGAT CCAGATGGTG
TTCCAGGATC CGTTCGCCAG CCTCAACCCG CGACGCAAGG TCGGGCAGTT GATTGCGGAG
GGCATGGAAA TCCACAAACT CGGCACCCGC CAGAAGCAGG ATGCGGAGGT CAAGCGGCTG
CTGACGCTGG TCGGACTGCC GGCCGATGCC GGCCAGCGCT TCCCGCACGA GTTTTCCGGC
GGCCAGCGCC AGCGCATCGG CATTGCACGC GCGCTCGCCG TAGCACCGGG CTTCATCGTC
GCAGACGAGC CGGTGTCGGC GCTTGACGTG TCGGTGCAGG CGCAGGTTCT CAACCTTTTG
CAGGACCTCA AGGAACAGCT TGGCCTGACC ATCCTGTTCA TCTCTCACGA TCTGGCCGTG
GTGGAGCATT TCTGCGACCG GGTGATCGTG CTGTATCTCG GCCGTATCAT GGAAATCGCC
CCGTGCCACC GGCTCTATTC CAAGCCGGCG CACCCCTATA CCGAGGCCCT TCTATCCGCG
GCTCCCGTCC CCGATCCGGA TCGGAAGGGC GACCGCATCG TTCTCGAAGG CGATATCCCG
AGCCCTATCG ATCCACCCTC CGGATGCGTG CTCAGAACCC GCTGTCGCTA CGCGCTGCCC
GACTGCGCCG CAGTCCGCCC CGAGCTGCGC GAAGTGGCGC CCAACCATTT CAAGGCCTGT
CTGCGCGACG ACATTCTTTA G
 
Protein sequence
MNNTAPLLDV SGLTKRFSVK GRGPKGEKRF VHAVDNVSFT IARGEVVGLV GESGSGKTTI 
GRTLMRLTDP TAGVIAYEGT DIAHLPAREM MAFRRKIQMV FQDPFASLNP RRKVGQLIAE
GMEIHKLGTR QKQDAEVKRL LTLVGLPADA GQRFPHEFSG GQRQRIGIAR ALAVAPGFIV
ADEPVSALDV SVQAQVLNLL QDLKEQLGLT ILFISHDLAV VEHFCDRVIV LYLGRIMEIA
PCHRLYSKPA HPYTEALLSA APVPDPDRKG DRIVLEGDIP SPIDPPSGCV LRTRCRYALP
DCAAVRPELR EVAPNHFKAC LRDDIL