Gene Rleg2_4540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4540 
Symbol 
ID6977634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp178985 
End bp179968 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content63% 
IMG OID643393718 
ProductNMT1/THI5 like domain protein 
Protein accessionYP_002278536 
Protein GI209546618 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.184718 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTTCC ACCCCCGCAA TCTCCTCCTG CCGGCCGTGA TCGCCCTCGG TCTTGCCACC 
CCGGCCGCCG CCGCCGCCAC GGTGAAGCTG CGCTACCTCG CCAGCCAAGG CGGTCTTGCC
GCCCACGAAC TTGCCGACGA ACTCGGCTAT TTCAAGGACA CCGGCATCAC GTTTGAGAAT
GTCGGCTATG CCCAGGGCGG TCCGGCCTCT CTGATCGCGC TCGCATCCGG CGATGTCGAG
ATCGGCAGTG CGGCCACCTC CGCGGTGCTG AATTCGATCA TCGGCGGCAA CGACTTCGTA
GCCGCCTATC CGTCGAACGG CATCAATGAC GAGGTGCAGT CGACTTTCTA CGTGCTGGAA
GACAGCCCGA TCAAAAGCAT CAAGGACATT GTCGGCAAGA GTATCGCGGT CAACACGCTC
GGTGCCCATC TCGACTACAC CATCCGCGAA GCCCTGCATT CTGTCGGCTT GCCGAGCGAC
TCCGCCAACC AGGTCGTCGT TCCCGGGCCG CAGCTCGAGC AGGTGCTGCG CTCCAAGCAG
GTCGATATCG CCGCCTTCGG CTATTGGCAG ACGACCTTCG AGGGCGCGGC GCTCAAGAAC
GGCGGCTTGC GTGCGGTCTT CGACGATACC GATGTGCTCG GCGACATTGC CGGCGGCTTC
GTGGTCCTGC GCCGAGATTT CATTCAGCAG CATCCGCAAG CCGCCAAGAT CTTCGTCGAG
CAGTCGGCCC GCGCCCTCGA TTATGCACGC GAACATCCTG AGGAAACCAA AAAGATCCTC
GCCAAGGCGC TCAGTGAGCG TGGCGAGAAC GCGGATATCG CGCAATATTT CCGCGGCTAC
GGCGTGCGCG CCGGCGGCCT GCCGGTCGAG CGCGATATCC AGTTCTGGAT CGACGTCCTC
GTCCGCGAAG GCAAGCTGAA GCAGGGCCAG CTGGCGGCCA AGAACATTCT CTTTACCGCC
GACGCCAAGC CGGCAAGCAA CTGA
 
Protein sequence
MTFHPRNLLL PAVIALGLAT PAAAAATVKL RYLASQGGLA AHELADELGY FKDTGITFEN 
VGYAQGGPAS LIALASGDVE IGSAATSAVL NSIIGGNDFV AAYPSNGIND EVQSTFYVLE
DSPIKSIKDI VGKSIAVNTL GAHLDYTIRE ALHSVGLPSD SANQVVVPGP QLEQVLRSKQ
VDIAAFGYWQ TTFEGAALKN GGLRAVFDDT DVLGDIAGGF VVLRRDFIQQ HPQAAKIFVE
QSARALDYAR EHPEETKKIL AKALSERGEN ADIAQYFRGY GVRAGGLPVE RDIQFWIDVL
VREGKLKQGQ LAAKNILFTA DAKPASN