Gene Rleg_3298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3298 
Symbol 
ID8014183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3299127 
End bp3300098 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content60% 
IMG OID644825857 
ProductNMT1/THI5 like domain protein 
Protein accessionYP_002977084 
Protein GI241205988 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.141874 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAGT TGATGGTTGC AATGATGGCG AGCGCGATGT CGCTTGCATC GGCCCATGCG 
ATGGCCGCCG ACAAGGTGGT GCTGCAGCTG AAATGGGTCA CGCAGAGCCA GTTCGGCGGT
TATTACGTCG CCAAGGAAAA GGGCTTCTAT AAGGAGGAAG GCCTCGACGT CGACATCAAG
CCGGGCGGCC CTGATATCGC CCCCGAGCAG GTGATCGCCG GCGGCGGCGC CGATGTCATC
GTAGACTGGA TGGGCGGTGC CCTGGTTGCC CGCGAAAAGG GCGTTCCGCT CGTCAACATC
GCCCAGCCCT ATCAGAAGGC GGGCCTGGAA ATGGTCTGCC GCAAGGACGG CCCGATCAAG
ACCGAAGCCG ACTTCAAGGG CCACACGCTC GGCGTCTGGT TCTTCGGCAA CGAGTATCCC
TTCTTCGCCT GGATGAACAA GCTCGGCCTG TCCACAGAAG GCGGTCCGAA CGGCGTCACC
GTGTTGAAGC AGAGCTTCGA TGTGCAGCCG CTTGTCCAGA AGCAGGCCGA CTGCATCTCT
GTCATGACCT ATAACGAATA TTGGCAGGCG ATCGATGCCG GCTTCAAGCC GGAAGAACTG
ACGGTCTTCA ACTACACGGA AATGGGCAAC GACCTTCTTG AAGACGGCCT CTATGCGATG
GAAGACAAGC TGAAAGATCC GGCCTTCAAG GAGAAGATGG TCAAGTTCGT CCGCGCATCG
ATGAAGGGCT GGAAATATGC CACCGAGAAT CCCGACGAAG CCGCCGAGAT CGTCATGGAT
AATGGCGGCC AGGACGACAA CCATCAGAAG CGCATGATGG GCGAAGTCGC CAAGCTGGTC
GGCGACAGCT CCGGCAAGCT GGACGAGGCG CTCTATGCCC GCACGGCAAA GGCGCTGCTC
GACCAGAAGA TCATAAGCAA GGAGCCGTCG GGCGCCTGGA CGCACGATAT CACCGACGCC
GCTTCCAAGT AG
 
Protein sequence
MRKLMVAMMA SAMSLASAHA MAADKVVLQL KWVTQSQFGG YYVAKEKGFY KEEGLDVDIK 
PGGPDIAPEQ VIAGGGADVI VDWMGGALVA REKGVPLVNI AQPYQKAGLE MVCRKDGPIK
TEADFKGHTL GVWFFGNEYP FFAWMNKLGL STEGGPNGVT VLKQSFDVQP LVQKQADCIS
VMTYNEYWQA IDAGFKPEEL TVFNYTEMGN DLLEDGLYAM EDKLKDPAFK EKMVKFVRAS
MKGWKYATEN PDEAAEIVMD NGGQDDNHQK RMMGEVAKLV GDSSGKLDEA LYARTAKALL
DQKIISKEPS GAWTHDITDA ASK