Gene Smed_3842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3842 
Symbol 
ID5318570 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp299046 
End bp300395 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content62% 
IMG OID640775654 
Productxanthine/uracil/vitamin C permease 
Protein accessionYP_001312587 
Protein GI150375991 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2233] Xanthine/uracil permeases 
TIGRFAM ID[TIGR00801] uracil-xanthine permease 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.899468 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGGA AAAAAATCGA TTCTATCGAC CCGACAGACC AAGCGCTGCC ACCACGCAGC 
CTGATCCTGT TCGGTCTGCA GCATGTGCTG GTAATGGCGG CGTCACCGAT AACCGCCGTG
TTTCTCGTAA GCAAGGCGCT CGGGTTTTCC GATGCGCTTA CGGTATCGCT GATCAGCGCG
ACATTTCTGA TCTGTGGTTT GGGGACAATC CTGCAGAGCT TCGGCCCGGC GGGTTTCGGT
GCGCGACTGC CCTTTATCAT GGTGCCGGGC GGGGCGCCGA TTGCGATCTT TCTCGCTATC
GCCCAGCAAA CCGACATACA GACGGCAGTC GGCGCGGTGA TCCTCACGGC CGGCTTCTAT
TTCCTGGCGC TGCCGGTATT CCGGCGGCTG CTGCGCTATT TTCCGCCCAT CGTGGTCGGC
ACAATGCTCC TGCTCGTGTC GGTGAACCTC GTTCGCATCT ACGGCGGTAC GATCACCGGG
AAACAGGGGA GCGAGGGTTT TGCCGATCCG ATGAATGTCG GGCTTGCCCT TGCGACGATC
GCCCTGACGG TGATCTTCGC CAGGATTTTT ACAGGCACGT TTCAGCGGAT TTCGGTGATG
CTCGGGCTCA TAGCAGGTTC GATGATCGCC TTTGGAGCCG GCTATATGGA CCTCTCCGGC
ATCTTCGACG GACCGGTCAT TGCCGTGCCC GCGCTTCTTC CGTTCGGGAT GCCGAAGTTC
GACATCTTTG CCGCCCTCCC GCTCATCGTG TTTTCCATCA TATCGATGGC CGAAGCGACG
GGCCAGACCA TCGCCACTGC CGAGATCGTC GGGCGTCGCG GCGATGCGCA CGCAATCGTG
CCAGCGACCA TCCGCGGCGA TGCCGTCGCC TCGCTTGTGG GCGGCCTGTT CGGAACATCG
CTGATCATCA CCAGCGGCGA AAACGTCGGC ATTGTCCGGG CGACCAACGT GAAGTCGCGT
TACGTCACCG CAATGGCTGG CGTGATCCTG GTCCTCATTG CCCTGCTTGC GCCGGTCGGT
CGGCTGGCCA ATGCCCTGCC CGGCCCTGTC GTCGGCGGAA CCGCGGTGAT CGTGTTCTCG
ATCATCGGCG TCATCGGGAT CGATCTCCTG CGTCGCGTGG ACCTGCGCGA GCATGGCCCG
ATGTTCACAC TGGCGGCGGC ACTATCCATG GGCCTGCTGC CTATCCTTGT TCCTGGCGTC
TACAGCCAGT TTCCGCAGTG GAGCCAGATG ATCCTCGCCA ATGGCCTTGC CGCCGGCACG
ATCACGGCCG TGATCGTCAA CGCTTTCTTC CAACACATGC CCTCCGGCTC GGCTCAAAAG
GCCGCCGCCG GCGTCGAGGC TGAAATTTAA
 
Protein sequence
MTGKKIDSID PTDQALPPRS LILFGLQHVL VMAASPITAV FLVSKALGFS DALTVSLISA 
TFLICGLGTI LQSFGPAGFG ARLPFIMVPG GAPIAIFLAI AQQTDIQTAV GAVILTAGFY
FLALPVFRRL LRYFPPIVVG TMLLLVSVNL VRIYGGTITG KQGSEGFADP MNVGLALATI
ALTVIFARIF TGTFQRISVM LGLIAGSMIA FGAGYMDLSG IFDGPVIAVP ALLPFGMPKF
DIFAALPLIV FSIISMAEAT GQTIATAEIV GRRGDAHAIV PATIRGDAVA SLVGGLFGTS
LIITSGENVG IVRATNVKSR YVTAMAGVIL VLIALLAPVG RLANALPGPV VGGTAVIVFS
IIGVIGIDLL RRVDLREHGP MFTLAAALSM GLLPILVPGV YSQFPQWSQM ILANGLAAGT
ITAVIVNAFF QHMPSGSAQK AAAGVEAEI