Gene Smed_4040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4040 
Symbol 
ID5318340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp501806 
End bp503035 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content62% 
IMG OID640775848 
ProductFAD linked oxidase domain-containing protein 
Protein accessionYP_001312781 
Protein GI150376185 
COG category[C] Energy production and conversion 
COG ID[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID[TIGR01679] FAD-linked oxidoreductase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.7202 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAACA CACCCTGGAG TGCAACCTCA AAAAACCTGG ACCAGGATTG CCACAACTGG 
TCGGGCGGTC TGCGCTTTCG TCCAGCGCGT CTGGAGGTGC CGGAGCATGA AGAGGCCGTG
GCAGCGCTCG TGCGCGCGGC CGGCAGGCAG GGCCGAACCA TACGGCCGGT GGGCTCGGCA
CATTCTTCGA GCGAGATTTA CGTCACGGAC GATGTCCTGG TTTCTCTGGC CAATATCTGT
GGTCTGCATG AGCATGACTC TCGATGTCAC CGAGCCGCGG TCGGTGCCGG CTCACAACTG
ACGGAGCTTA GCAAGGAGCT GCAGTCGGCG GGCATGACGC TCTCAAATTT CGGCGACGTC
GCAACCCAGA CCGTCGGGGG CGCGATCGGA ACCGGCACGC ATGGCTCCGG ACGAAACTTC
CCCAACCTTT CGATGATGCT TGTCGGCGGC CGCCTGGTCA CCGCCCGGGG AGAGATCACC
ACCTTCGGCG TCGAAGAGGA CCTGGATTTC GTGCGAGCCT TGCGTGTTTC CTTCGGGACG
CTCGGCATTC TCACCTCAGC CACTCTCCAG CTCGAACCCT TGCACGATCT CCGCCGTCAG
GAATGGTGCC TTGGCTTCGA GCCCTGCATG GAGGCACTCG ATCGGCTTTC CCGGGAAAAT
CGGAACTTCG ACTTCTATTG GTATCCCCGC TCCGATGAGG TGAAGATCCG TTGCCTCAAC
CCGCCGGGCG AGGAAAAGAC TTATGGCGCC TTCGCCCGGC TGGCGAAGGA CGAGACCGGG
CCGCCGCACG AGGTCATTCC GCAGCACAGC GATCTTCCTT ATCGCTTCGA GGAAATGGAA
TATTCCATGC CGGCCGAGGC CGGACCGGAT TGCATGAGAA AGCTGCGCAC GCGCATCAAG
GAAAAATGGC GCCGCTCGGT CGGCTGGCGC GTGCTCTACC GTTACATCAA GCGTGACGAC
ACCTGGCTGA GCGAAGCCTA TGGCCGGGAC TCTGTGAGCA TATCGCTCCA TCAGAACGCG
ACGCTGCCCT ATTGGGACTT CTTTCTCGAC CTCGAACCGG TGATGCGGGA CCATGGCGGC
CGGCCGCACT GGGCGAAAAA GCACAGTCTT CGCGCGACCG AACTCAAGGC CCTCTACCCG
ATGTGGGATC GTTTTCTTGC CCTTCGGCAG GAGCTGGACC CGGAGGGGCG GTTTCTAACG
CCTTATTTGC GCAGACTTCT CGGGTGCTAG
 
Protein sequence
MANTPWSATS KNLDQDCHNW SGGLRFRPAR LEVPEHEEAV AALVRAAGRQ GRTIRPVGSA 
HSSSEIYVTD DVLVSLANIC GLHEHDSRCH RAAVGAGSQL TELSKELQSA GMTLSNFGDV
ATQTVGGAIG TGTHGSGRNF PNLSMMLVGG RLVTARGEIT TFGVEEDLDF VRALRVSFGT
LGILTSATLQ LEPLHDLRRQ EWCLGFEPCM EALDRLSREN RNFDFYWYPR SDEVKIRCLN
PPGEEKTYGA FARLAKDETG PPHEVIPQHS DLPYRFEEME YSMPAEAGPD CMRKLRTRIK
EKWRRSVGWR VLYRYIKRDD TWLSEAYGRD SVSISLHQNA TLPYWDFFLD LEPVMRDHGG
RPHWAKKHSL RATELKALYP MWDRFLALRQ ELDPEGRFLT PYLRRLLGC