Gene Rleg2_5561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5561 
Symbol 
ID6978655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp1208971 
End bp1210320 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content63% 
IMG OID643394659 
Productputative nitrilotriacetate monooxygenase protein component A 
Protein accessionYP_002279477 
Protein GI209547559 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.200364 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCAGA AACACGTCAC GTTCGGCATC ATGCTGCAGG GTCCCGGCGG TCACATGAAT 
GCCTGGAAAC ATCCAAGCGG ACCGGCTGAT GCCAGCGTCA ATTTCGACTT CTTCGTCAAC
ACGGCGCGCA AGGCGGAGGC GGCCGGTATC GCCTTCGCTT TCGTCGCCGA CGGGCTCTAT
ATCAACGAGC AGTCGATCCC GCATTTCCTC AACCGGTTCG AGCCGATCGC CATTCTCTCG
GCGCTTGCCG CCTCGACTTC GAAAATCGGC CTCGTCGGCA CAGTCTCGAC CTCCTACAGC
GACCCCTTCA CCATCGCGCG CCAGTTCGCT TCGGTCGATC TCATCAGCGG CGGCCGGGCA
GGGTGGAATG CCGTGACCTC GCCGCTCGAA GGCTCGGGGC GCAATTACAG CCGCGAACAC
CCCGAACACG AACTGCGCTA CGAGATCGCC GAGGACTACA TCGATGCGAT CAAAGGCCTC
TGGGATTCCT GGGATGACGA CGCCTTCGTG CGCAATCGCG AAACCGGCGT CTATGCCGAC
AAGACCAAGA TGCACCGCCT CGACCACAAG GGCCGCTTTT TCCGCATCGA AGGGCCGCTC
AACATCGGCC GTTCGAAGCA GGGGCAACCG GTGGTCTTCC AGGCCGGCGC TTCGGACTCC
GGCATCAGGC TTGCCGGCAA ACATGCCGAT GCCGTCTTTA CCAATGGCGG ACCGTTCGAG
GAGGCGCAGG CCTTCTATCG GCAGCTGATG GATAGCGTCA TCGCTCATGG ACGGCCCGCG
GCGGAAGTCG GCATCTATCC CGGCATCGGC CCGATCGTCG GCAAGACGGC CGAGGAAGCG
GAAGCCAAAT ATCAGGCGAT CCGCAATCTC GTCACCATCG ACGAGGCGCT CCTCTATCTC
GGCCGCTTCT TCGATCACCT TGATTTCAGC GTCTACCCGC TCGATGAGGC TTTCCCGGAT
CTCGGCGATA TCGGCAAGAA CAGCTTCCGC GCGACCACCG ACCGCATCAA GAGGACAGCG
CGCGAAAAAG GCCTGACACT GCGCGAAATC GCGCTCGATG TCGCCACGCC ACGCACCGCC
TTCATCGGCA CGGCGGAGCA TATCGCCGAC GAGATCATTC GCTGGGTGGA CAACGGCGCC
GCCGACGGCT TCATCCTCGG TTTCCCCGTC ATCGCCGAGG GCTTCGACGA TTTTGCCGAA
CACGTCCTGC CGGTCCTGAC CGAGCGGGGG TATTTCGATC CCGTCCTGAA GGGCGAGACG
CTGCGCGACC ACCTCGGCCT GCCCTTCCGC GAAAGCCGGT ATGCGGCCAG TGCCGATCAG
CTCGAGCCCG GAAAGGCTGT CGGCGCCTGA
 
Protein sequence
MAQKHVTFGI MLQGPGGHMN AWKHPSGPAD ASVNFDFFVN TARKAEAAGI AFAFVADGLY 
INEQSIPHFL NRFEPIAILS ALAASTSKIG LVGTVSTSYS DPFTIARQFA SVDLISGGRA
GWNAVTSPLE GSGRNYSREH PEHELRYEIA EDYIDAIKGL WDSWDDDAFV RNRETGVYAD
KTKMHRLDHK GRFFRIEGPL NIGRSKQGQP VVFQAGASDS GIRLAGKHAD AVFTNGGPFE
EAQAFYRQLM DSVIAHGRPA AEVGIYPGIG PIVGKTAEEA EAKYQAIRNL VTIDEALLYL
GRFFDHLDFS VYPLDEAFPD LGDIGKNSFR ATTDRIKRTA REKGLTLREI ALDVATPRTA
FIGTAEHIAD EIIRWVDNGA ADGFILGFPV IAEGFDDFAE HVLPVLTERG YFDPVLKGET
LRDHLGLPFR ESRYAASADQ LEPGKAVGA