Gene Smed_5310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5310 
Symbol 
ID5319612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp265773 
End bp267776 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content62% 
IMG OID640777085 
Productoxidoreductase domain-containing protein 
Protein accessionYP_001314017 
Protein GI150377422 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases)
[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.400493 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTCAG TACAACAGAT CCGTTGGGGT ATCATCGGCC CCGGCACCAT CGCGCGGACC 
TTTGCCGAAG GGATCGCCCA TTCGGCAACA GGGCGCCTCG TCGCCGTCGC GACGCGCAAC
ACGGATAAGC CCGGGCTTGG CGATGCATTT CCGGGGGTGC GGATCGTAAA GGGTTACGAC
AGGCTTCTCG CAGATCCGGA GGTCGATGCC ATTTACGTGG CCACGCCACA TCCGAGCCAT
GCGGAATGGG CGATCAAGGC GACACGCGCC GGAAAGCACG TGCTGGTCGA AAAACCGATG
GCGCTTTCGG CTTATGATGC GAGATCGATC TTTCACGAGG CGCAGAAGGC CGGCGTTTTC
GCCGGAGAAG CCTTTATGTA CAGGTTCCAC CCGCAGACCG GTCGTCTGGT CGACCTGGTG
CGGGAGGGGA CAATCGGCGA GGTGCGGATC ATCCGCTCGA GTTTCGGCTT CAACATGGGC
GCTTATCGCC CGGAGCATCG CCTGTTTGCC AATGATCTCG CCGGCGGCGG CATTCTCGAT
GTCGGCGGTT ATCCGGTCTC GATGGTGTGC ATGCTCGCGG GCGCTGCCGG GAAGAAGCCA
TTCCTTGAGC CGGTGAAAGT CTCTGGCGCT GCCCGTCTGG GACCGTCCGG CGTCGACGAA
TGGGCATCGG CTGTTCTCAA GTTTCCGAAC GACATCGTCG CCGAGGTGTC CTGCTCGATC
TTGGCAGAGC AGGACAACGT GCTGCACATC ATCGGTTCGG AAGGCCGCAT CGAGGTGAAG
GACTTCTGGT TTGCCTCCGG CAGGCGGGGC GGGGTGGGCC GCATTCAAAT CATCAGGGGT
GATCGCCGGC AGGCAATCGA GATCGAGGAA AAGCGCCATC TCTACGCATT CGAGGTCGAT
GCGGTGGGCG AGGCGATCCG CGCCGGCCGC ACCGAGTTCG CCTTCCCCGG CATGAATGCG
GAAGAAACGT TCGCCAATCT GCGGGTTCTC GACCGCTGGC GCGCATCGGT CGGCCTCGAA
TACGAAATCG AGACGGCCGC CAGACGGACG GTCAATATCG CCGGCGAGAA GGTCGTCGCC
GGGGACGGCG TGCCCAGACG GTACATTCCC GGGCTCGTCA AACGAGCCTC CACGGTCGCG
CTCGGCTTCG AATTCTTCCC GAGTTTCGCA GCCGCCTCGC TGACACTCGA TGCCTTCTAC
GAGGCCGGCG GGAATCTGTT CGACACCGCA TTCGTCTACG GTGCGGGCAA GACCGAAAAG
ATTTTCGGCG ACTGGCACAC CAGCCGAGAG GTCAGACGGG ACGATATCGT CCTCATAGGC
AAGGGCGCCC ATTCGCCGCT CTGCTATCCG GATGTGATTG CGAAACAATT GGATCAGTCG
CTGGAGCGGC TGAAGACAGA TTATGTCGAC GTCTACTTCA TGCACCGCGA CAATGCGGAT
GTTCCCGTCG GCGAATTCGT CGATGCGATG GATGACGAGG TCCGGCGCGG ACGTATCCGC
GGTATTTTCG GCGGATCCAA TTGGACGCGG GAGCGCATGG ACGAGGCGAT CGCCTATGCC
GAGAGAAACG GCAAGACGTC GCCAGCCGCT CTTTCAAACA ACTTCTCGCT GGCGCAGATG
CTCGATCCGA TCTGGCCCGG TTGCGTTGCG GCATCGGACG ACAGCTGGAA AGACTGGATG
ACGACGCGCC AGATTCCGAA CTTCGCCTGG TCGAGCCAGG GGAGACGCTT TTTCACGGAG
CGGGCAGGGC GGGACAGGCA TGAGGATGAG GAACTCGTCC GCGTCTGGTA TTCCGATCTG
AACTTCAGGC GCCGGGACCG CGCAATCGAA CTCGCCAAAA AGCGCGGCTG CAGCCCGATC
CACATCGCGC TCGCCTATGT CATCGCACAA CCCTTCCCTG TCATACCGCT CATCGGACCG
CGGACGGTGG CGGAACTGGA AGACAGCCTC TCGGCCATTC ATATCGCTCT TAGTCAGGAT
GAGGTGCGTT GGCTGGACGG CTGA
 
Protein sequence
MSSVQQIRWG IIGPGTIART FAEGIAHSAT GRLVAVATRN TDKPGLGDAF PGVRIVKGYD 
RLLADPEVDA IYVATPHPSH AEWAIKATRA GKHVLVEKPM ALSAYDARSI FHEAQKAGVF
AGEAFMYRFH PQTGRLVDLV REGTIGEVRI IRSSFGFNMG AYRPEHRLFA NDLAGGGILD
VGGYPVSMVC MLAGAAGKKP FLEPVKVSGA ARLGPSGVDE WASAVLKFPN DIVAEVSCSI
LAEQDNVLHI IGSEGRIEVK DFWFASGRRG GVGRIQIIRG DRRQAIEIEE KRHLYAFEVD
AVGEAIRAGR TEFAFPGMNA EETFANLRVL DRWRASVGLE YEIETAARRT VNIAGEKVVA
GDGVPRRYIP GLVKRASTVA LGFEFFPSFA AASLTLDAFY EAGGNLFDTA FVYGAGKTEK
IFGDWHTSRE VRRDDIVLIG KGAHSPLCYP DVIAKQLDQS LERLKTDYVD VYFMHRDNAD
VPVGEFVDAM DDEVRRGRIR GIFGGSNWTR ERMDEAIAYA ERNGKTSPAA LSNNFSLAQM
LDPIWPGCVA ASDDSWKDWM TTRQIPNFAW SSQGRRFFTE RAGRDRHEDE ELVRVWYSDL
NFRRRDRAIE LAKKRGCSPI HIALAYVIAQ PFPVIPLIGP RTVAELEDSL SAIHIALSQD
EVRWLDG