Gene Smed_6252 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_6252 
Symbol 
ID5320554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp1172452 
End bp1173492 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content59% 
IMG OID640777852 
Productluciferase family protein 
Protein accessionYP_001314784 
Protein GI150378189 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.905079 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.361259 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTCT CACTCTTCGT TCACATGGAG CGCCTGGATG CTTCGCAAGA CCACAGGACG 
CTTTACGAAG AATTCATCAA GCTGTGCGAA GTCGCCGACA AAGGCGGCAT GCACGCGATC
TGGACCGGTG AACATCATGG AATGGAGTTC ACCATTGCGC CGAATCCCTT CATAACGATT
GCCGACCTTG CCCGCCGCAC TAAGACCGTG CGGCTTGGAA CCGGCACGGT GATCGCGCCC
TTCTGGCATC CGATCAAGCT CGCGGGAGAA GCCGCAATGA CGGATCTGAT CTGCGAGGGT
CGCCTCGACA TCGGAATTGC CCGCGGCGCC TATTCCTTCG AGTACGAGCG GCTGCTGCCG
GGCCTCGACG CCTGGAGCGC TGGGCAGCGC ATGCGCGAAC TCATTCCGGC GGTGAAGGGG
ATCTGGGCGG GTGATTACGC CCACGACGGC GAGTTCTTCA AGTTTCCGGC CACGACCTCG
TCACCGAAGC CGCTGCAGAA GCCCCATCCG CCGATATGGG TTGCTGCGCG CGACCCCAAC
TCGCACGAGT TTGCCGTTTC GAACGGCTGC AATGTGCAGG TGACGCCACT CTGGCAGGAC
GACGAGGAGG TTCGGAGCCT GATGGGACGG TTCAACGACG CCTGCGCCAA GGATCCAGAG
GTCCCGCGCC CGAAGATCAT GCTGCTGCGG CACACCTATG TCGGCTCCGA CGAGGCGGAT
ATCGCGCAGG CAGCTCATGA GATGAGCGTA TACTACAATT ACTTCTTCGC CTGGTTCAAG
AACGAAAGAC CGATCAGACA AGGCCTCATT GATCGGATTC CGGACGAGGA AATTGCCGCC
AATGCCATGC TCTCAGGCGA GGCAATGCGA CGCAACAACG TCGTCGGCGC AGCCGACGAG
GTCATCGCCC GCATCAAGAG CTACGAGGCA ATGGGATATG ACGAATATTC CTTCTGGATA
GACACAGGCA TGACCTTCGA GCGCAAGAAG GCTTCGCTCG AACGCTTCAT CGCCGATGTC
ATGCCAGCAT TTGCGGAGTA G
 
Protein sequence
MKFSLFVHME RLDASQDHRT LYEEFIKLCE VADKGGMHAI WTGEHHGMEF TIAPNPFITI 
ADLARRTKTV RLGTGTVIAP FWHPIKLAGE AAMTDLICEG RLDIGIARGA YSFEYERLLP
GLDAWSAGQR MRELIPAVKG IWAGDYAHDG EFFKFPATTS SPKPLQKPHP PIWVAARDPN
SHEFAVSNGC NVQVTPLWQD DEEVRSLMGR FNDACAKDPE VPRPKIMLLR HTYVGSDEAD
IAQAAHEMSV YYNYFFAWFK NERPIRQGLI DRIPDEEIAA NAMLSGEAMR RNNVVGAADE
VIARIKSYEA MGYDEYSFWI DTGMTFERKK ASLERFIADV MPAFAE