Gene Rleg2_5003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5003 
Symbol 
ID6978097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp650457 
End bp651455 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content57% 
IMG OID643394149 
ProductNMT1/THI5 like domain protein 
Protein accessionYP_002278967 
Protein GI209547049 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAATA TTTCTCGACG TAACATGCTT GCGCTCATGG GCGGGACGCT TGCCGCGCCG 
TACATCATCA CCTCAAAAGC CAGTGCAGCC GCGTCTGCGG CGGAGCCTGG CATCTTCAAG
ATGTCCATTC AGCCTTGGAT CGGCTACGGC ATGTGGTATG TCGCCCAGGA GAAGGGAATA
TTTGCCGCGA ATGGACTCGA AGGTGTCGAG TTTATACCGT TTACGGAAGA CGCCGCCAAT
CTGGCGATCC TCGCAAGCGG AGAGATCCAA GCCACGAATA GCGCCACGCA CAACGTGCTG
CAGCATCTGC AGGAATCGCC CGACTACCAT ATTGCCCTGA TCGAGGATTA CAGCACGACC
GCCGACGCCA TCGTTAGCGC AAAGACCGTC AATTCCGTCG CGGATCTCAA GGGAAAATCG
ATCGCATTTG AAGAGGGTAC GACCAGCCAC CTCCTCATCG CTGACGCGCT GAAGAAAGCG
GGTATGAAGT TGAGCGATCT CAACTGGACC AAGACACCAG CATCGCAGGC CGCCGCTGCC
CTCTTGTCGG GCTCCACCGA CGCGATGGTT ACTTATGAAC CCTACGTTTC GACAGCCTTG
AAGGCCGATC CGAGCCTCAA GCTCCTCTAT ACAGCGGCCG AGAGCCCCGG CCTCATCAGC
GACTGCTTCG TCGTCACGAC ACGGACGTTG AAGGAGCGGC CCGGACAGGT TCGCGCGATG
GTAAAGAGCT GGGGGGACGC TGTCGACCTT TACAACAAGA ACCCGCAGGA GTGCCAGGCA
ATCATCGCGA AGGGGGTTGG GTCCGATCCA AGCGAACTCG GGTCGACCTT CGAGGGCGTC
CACTACTACA CCCTTGCAGA AAACAAAGCC CTGTTGAGCG GTGAATATGC GGCGAAGACA
TTCCCGGCCG TGAACGCGGC CAGCCTTGAA CTTGGTCTGA TCAAAAAGGC ATTTGAGGCG
AAAGACGTCA TCGACACAAC GGCTCTTTCG TCACTTTAA
 
Protein sequence
MTNISRRNML ALMGGTLAAP YIITSKASAA ASAAEPGIFK MSIQPWIGYG MWYVAQEKGI 
FAANGLEGVE FIPFTEDAAN LAILASGEIQ ATNSATHNVL QHLQESPDYH IALIEDYSTT
ADAIVSAKTV NSVADLKGKS IAFEEGTTSH LLIADALKKA GMKLSDLNWT KTPASQAAAA
LLSGSTDAMV TYEPYVSTAL KADPSLKLLY TAAESPGLIS DCFVVTTRTL KERPGQVRAM
VKSWGDAVDL YNKNPQECQA IIAKGVGSDP SELGSTFEGV HYYTLAENKA LLSGEYAAKT
FPAVNAASLE LGLIKKAFEA KDVIDTTALS SL