Gene Rleg_2019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2019 
Symbol 
ID8013052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2010631 
End bp2011854 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content67% 
IMG OID644824606 
Productmolybdenum cofactor synthesis domain protein 
Protein accessionYP_002975837 
Protein GI241204741 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0303] Molybdopterin biosynthesis enzyme 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00400246 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTTC TCCCGGTCGC CGACGCCCTG AACCGCTTGC TCTCTCGCGC AAAACCCACT 
GCTGCGTCCG AGACGCTGCC GCTAGCCGAA GCCGAGGGCC GCGTCCTGGC CGTCGATCTG
ACGGCCGGCC TGACCCAGCC GCCCTTCAAT GCCTCCGCCA TGGATGGCTA TGCGCTGCGC
CGCGAAGACG CGCCGGAGCC GGGCGCCGAG CTGAAGGTCA TCGGCACGTC TTCCGCTGGC
CACGGCTTCG AGGGAAGCGT CAGCCAGGGT GAAGCCGTCC GTATCTTCAC CGGCGCGCCT
GTTCCGCCGG GCGCCGACAG CGTCCTGCTG CAGGAGGATG CGGAGAAGAT CGAAGGCGGT
ATCCGAACCA ATTTCCCTGT GCGGCAGGGC CAGCATGTGC GTCCGCGCGG TCAGGATTTC
GCCGAAGGCG AAGCCGTTCT GTCGGCCGGC ACCGTGCTCG ATTTCTCGCG GCTGACGGTT
GCTGCCGGCA TGAACCGGCC TGATGTCGAA GTGCTGCGGC GCCCGCTGAT CGCCATCCTC
GCGACCGGCG ACGAACTGCT GCCGCCCGGA AGCACGCCCG GCCCTTCACA GATCATCGCA
TCCAATACGT TTGGTATTGC AGCCCTTGCC CGCAAAGCCG GCGCCGATGT GCTCGATCTC
GGTATCGTGC CTGACGACAA GGCTAGGATC ACCGCGGCGA TCGACACGGC CCGGGATGCC
GGGGTCGATG TGATCGTCAC GCTCGGTGGC GCTTCGGTCG GCGACCATGA TCTGGTGCAG
GCCACGCTGA TCGAAGCCGG CATGCAGCTC GATTTCTGGC GCATCGCCAT GCGCCCCGGC
AAACCGCTGA TGGTCGGCAG CTTCGGCGAG ACGCATGTGC TCGGCCTGCC CGGCAATCCG
GTCTCGAGCC TCGTCTGTTC GCTGCTCTTC CTGGAACCGC TGATCCGCAG GCTTGCCTCC
CTGCCGCCGG TGCGCCGCGA GGCAACGGTG GAGGCGGCTG TCACGCTGCG CGCCAACGAC
CACCGGCAGG ACTATATCAG GGCGAAACTT TCGAAATCCG CCGCCGGCCA CTGGCTCGCC
GAGCCTTTCG GCAAGCAGGA TTCCTCGATG ATGAAGGTCT TCGCCGCAGC CGATTGCCTC
GTCATCCGCC CGCCACATGC GCCGGAGCTG CTGGCCGGAG CGCCCTGCCC GGTCATGCTG
TTGCGGCCGG ACCTTCTGGC CTGA
 
Protein sequence
MNLLPVADAL NRLLSRAKPT AASETLPLAE AEGRVLAVDL TAGLTQPPFN ASAMDGYALR 
REDAPEPGAE LKVIGTSSAG HGFEGSVSQG EAVRIFTGAP VPPGADSVLL QEDAEKIEGG
IRTNFPVRQG QHVRPRGQDF AEGEAVLSAG TVLDFSRLTV AAGMNRPDVE VLRRPLIAIL
ATGDELLPPG STPGPSQIIA SNTFGIAALA RKAGADVLDL GIVPDDKARI TAAIDTARDA
GVDVIVTLGG ASVGDHDLVQ ATLIEAGMQL DFWRIAMRPG KPLMVGSFGE THVLGLPGNP
VSSLVCSLLF LEPLIRRLAS LPPVRREATV EAAVTLRAND HRQDYIRAKL SKSAAGHWLA
EPFGKQDSSM MKVFAAADCL VIRPPHAPEL LAGAPCPVML LRPDLLA