Gene Smed_3690 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3690 
Symbol 
ID5318808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp130528 
End bp131802 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content62% 
IMG OID640775503 
Productmajor facilitator transporter 
Protein accessionYP_001312436 
Protein GI150375840 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.762235 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCTC TTCACCAGAC ACAGTCGACC TTAAAGGGCG CATCGGCACG CGCTCTGTGG 
GCCTCGACGG GCGCATTTAC GATCTGTTTC GCAATCTGGA CCATTTTTTC CATCATCGGC
GTGCGTATCA AGGAGGATCT CGGCCTCAGC GAAGCCGAGT TCGGGCTGCT GATCGGCATG
CCGATCCTGA CCGGCTCGCT CGTGCGCATC GTTCTCGGCA TCTGGACAAC CCGGTATGGC
GGCCGGCTCG TCTACACCCT CACCATGCTG GCGGCAGCGC TTGCGACCTT TCTTCTCGGC
TACGCAACGA CCTATCCGCA GATGCTCCTC GCGGGCCTTG GCGTCGGTCT TGCGGGAGGT
TCTTTTGCCG TAGGAGTCGC TTATGTCTCG CCCTTCTTTC CGCCTGAGAA GCAGGGCACG
GCGCTCGGCA TCTTCGGCGC CGGCAATGTC GGCGCGGCAG TGACGAAATT CCTGGCGCCT
TTCGTTCTCG TCGCTTTCGG CTGGCAGGCG GTCGCCGAGA TCTGGGCTGC CGTTCTGGCC
GTCACAGCCA TTATCTTCTG GTTCTCGACC GAAGACGATC CGCAATTTCG CGCCCGCCGC
CAGGGTGCCG TTGCGCGCAA GAGCCTGCTT AGTGAATTCG AGCCTCTCAA GAATGTCCAG
GTCTGGCGCT TTTCGCTCTA TTATTTCTTT GCCTTCGGCG GCTTCGTCGC CCTTGCGCTC
TGGCTGCCGC GCTACCTCGT GGGCGTCTAC GGTTTCAACA TCGAGACGGC TGGCATGATC
GCAGCCGCGT ACTCCATTCC GGGAAGCATC TTTCGGGCCT ATGGAGGCGC CCTGTCGGAC
AAGCTGGGCG CCCGCAAAGT CATGTATGCC ATGTTCGCGG TATCTGCCGT CGCAACGGCC
ATCCTGTCGG TACCCGCCGG CTCGGCGAGC GGTGCGATGC CGATCATGGT GACGCCGCTC
GTCTTCGTCT TCGTCACTTT CGTGCTCGGC TTCGTCATGA GCCTCGGCAA AGCGGCGGTC
TACAAGCATA TACCCGTGTA TTATCCCACC CATGTCGGTG CTGTCGGCGG GGTCGTCGGC
ATGGTGGGCG GCCTCGGCGG CTTCGTCCTG CCGATCGCTT TCGGCTATCT CAAGGATGTA
ACCGGTCTCT GGTCGAGCTG CTTCATGCTG CTCTTCGTCA TCGTAGCCGT CTCCATGATC
TGGATGAAGG TCTCGATCCG GCAGATCAAG CGCGAGGCCG ACAGGGGCGT CCCGTCCGGC
GCGCTCGCGC GGTAG
 
Protein sequence
MSALHQTQST LKGASARALW ASTGAFTICF AIWTIFSIIG VRIKEDLGLS EAEFGLLIGM 
PILTGSLVRI VLGIWTTRYG GRLVYTLTML AAALATFLLG YATTYPQMLL AGLGVGLAGG
SFAVGVAYVS PFFPPEKQGT ALGIFGAGNV GAAVTKFLAP FVLVAFGWQA VAEIWAAVLA
VTAIIFWFST EDDPQFRARR QGAVARKSLL SEFEPLKNVQ VWRFSLYYFF AFGGFVALAL
WLPRYLVGVY GFNIETAGMI AAAYSIPGSI FRAYGGALSD KLGARKVMYA MFAVSAVATA
ILSVPAGSAS GAMPIMVTPL VFVFVTFVLG FVMSLGKAAV YKHIPVYYPT HVGAVGGVVG
MVGGLGGFVL PIAFGYLKDV TGLWSSCFML LFVIVAVSMI WMKVSIRQIK READRGVPSG
ALAR