Gene Smed_5237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5237 
Symbol 
ID5319539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp198075 
End bp199619 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content59% 
IMG OID640777014 
ProductAbgT putative transporter 
Protein accessionYP_001313946 
Protein GI150377351 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2978] Putative p-aminobenzoyl-glutamate transporter 
TIGRFAM ID[TIGR00819] p-Aminobenzoyl-glutamate transporter family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACGG CGGGAGCAGT TCCGAAAACC GTGATGCAAA GGTTCCTGGA CGGCGTCGAA 
AAGGTGGGCA ACATGGTCCC CCACCCCGTC ATAATCTTCC TTATTCTGAT TGCCGGGGTA
ATCGCGCTGT CAGCGGTTCT GGGTATTTTT GGGACCTCGG TGACCTTCGA GCGGATCAAT
CCGGATACCC ATGAGCTCGA GTCCGCAACG ACTTCAATAA GAAGCCTGTT GAATACCGAC
GGCATTCGCT TCATGTATGT GTCGCTTATC CCTAACTTCA TGAGCTTCAC TGCAGTCGGT
CTCATGATTG CCGCGATGAT CGGCGCCGGC GTCGCCGAGG AGTCCGGGCT CGTTACAGCG
CTTATCCGGA AATTGGTGAT CGTGTCTCCG CGCTGGGCCT TGACCTACAT TCTCGCTTTC
GTTGGAATTA TCGCGAGTGT TGCCGCGGAC GCGGGTTATC TGGTTCTGAT CCCTCTTGCG
GGTATTGCCT ACCTGGCGGT GGGGCGTCAT CCGCTTGCAG GTCTCGCACT GGGGTTTGCC
GCGGTGGCCG GCGCTTTCAC GGTCAACATG CTGATCAAGC CGCTTGATGC GGTTCTTGTG
GAGTTCACCA ATGACGCCGC CCATCTGGTC GATGCGAACA GGTCGATTGG GCTGGCGTCC
AACATCTGGT TCTCCATAGC ATCGGTCCTG TTTCTGACCG GCGTGATCGC CTTCGTCTCA
GACCGCATGA TCGAACCCAG ACTTGGGACC TATGTCCCCG ACGAAGACGC TGAGCGCGCG
AACGAAGGAG CCGCGCTCTC AGCGTCTGAA TCGCGCGGGC TAAGGTTCGC GTCCTTCGGT
CTGATCGGCC TCGTGATCGT GTTCTGCCTC CTGACGCTGC CCGGTGGCGC ACCGCTCCGG
AACGCCGAAA CCGGCGAGCT GATCGGCAAC TCGCCCTTCA TGAACGGCCT CATCGCATTG
ATCATGCTTG TCTTCCTCGT CTCCGGGTGG TGCTACGGCA TCGGGGCCGG GACCCTGCGG
ACCCTTACCG AGGTGATCAC GGCCGTCGAG AAATCCATCA GAAATCTCGG CGGTACGATC
TTCCTGTTCT TCGTGCTGAG CCAGTTCGTC GCCTACTTCA CCTATACCAA CATGGGCACC
GTCATGGCGT TGAGCCTTTC CAGCGCGCTG CAGGCCGCCA ATATCGGCGC CCTGCCCCTG
TTACTCGGCT TCATCATCGT CGTTGCGATC ATCGATCTCC TGTTGACCGG CGCAATCGCA
AAATGGGCGA TATTCGCGCC AGTCTTCGTG CCGCTCCTGA TGAAGCTCGG TGTTGAGCCG
GAGGCGGTCC TGGCTGCTTA CCGGGTCGGG GATTCTCCGA TGAACGCCAT CACCCCGCTC
AACGCCTACT TCGCTTTGGT CGTAGGGTTC GCCCAGAAAT ACGACAGGTC GGCTGGCGTC
GGGACAATAG TGTCACTGAT GCTGCCTTAC GTGATCTGGA TGTTCGTGTT GTGGACCTTG
CTGTTCGCGG TCTGGAAGAT GGTTGGACTT CCTTGGGGAC TGTAG
 
Protein sequence
MSTAGAVPKT VMQRFLDGVE KVGNMVPHPV IIFLILIAGV IALSAVLGIF GTSVTFERIN 
PDTHELESAT TSIRSLLNTD GIRFMYVSLI PNFMSFTAVG LMIAAMIGAG VAEESGLVTA
LIRKLVIVSP RWALTYILAF VGIIASVAAD AGYLVLIPLA GIAYLAVGRH PLAGLALGFA
AVAGAFTVNM LIKPLDAVLV EFTNDAAHLV DANRSIGLAS NIWFSIASVL FLTGVIAFVS
DRMIEPRLGT YVPDEDAERA NEGAALSASE SRGLRFASFG LIGLVIVFCL LTLPGGAPLR
NAETGELIGN SPFMNGLIAL IMLVFLVSGW CYGIGAGTLR TLTEVITAVE KSIRNLGGTI
FLFFVLSQFV AYFTYTNMGT VMALSLSSAL QAANIGALPL LLGFIIVVAI IDLLLTGAIA
KWAIFAPVFV PLLMKLGVEP EAVLAAYRVG DSPMNAITPL NAYFALVVGF AQKYDRSAGV
GTIVSLMLPY VIWMFVLWTL LFAVWKMVGL PWGL