Gene Rleg2_1454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1454 
Symbol 
ID6980182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1479910 
End bp1481187 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content65% 
IMG OID643396175 
Productnitrate ABC transporter, substrate-binding protein 
Protein accessionYP_002280974 
Protein GI209549057 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGACGA ACACCGCCAG TTCCAGCGGC GGGCTGCCCT CGCCCGCGCA TATCAACAGT 
GAAGGGCCAA AAGTGCTGCG GGCCGGTTTC ATTCCGCTCG TCGACGCATC GGTGCTGATA
GCAGCGGCGG AATTCGGTTT CGCGCAGAAG GAAGGCCTGA CGCTCGATCT CGTCAAGGAC
GTCTCCTGGG CGAATGTGCG CGACCGCCTG GCGTTCCGCC AGTTCGACAT TGCCCATATG
CTGTCGCCGA TGCCGGTCGC CTCCATGCTC GGTCTCGGCT CCAATCCCTC GCCGACGATC
ACGCCGTTTT CCCTTGGGCG CGGCGGCAAT GCAATCACGC TGTCGACGCG GCTCTTCGAC
AGGATGCGGG ATGACGCCGG ATTGCCGGAA ACGGCAAGCG CGCTCGACAA CGCCCATGCT
CTGGCCAAAA CGGTCGCGGC GATGAAGGCG CGCGGCGAGC CGCTGCCGAC CTTCGGGGTC
ACCTATCCTT TCTCCTCGCA CAATTACGAA TTCCGCTACT GGCTGGCGGC CGGCGGCATT
AATCCCGACA AGGACGTCAA GCTCGTCGTC GTGCCGCCGC CGCTGACCTC GGATGCCCTG
GCGGCCGGGG CGATCGACGG CTTCTGCGTC GGCGCGCCTT GGAACATGGT CGCCTCGGAG
CGCGGTGTCG GGCGCATCGT CGCCGCCAAA CAGGATATCT GGCCGTCGGC CCCGGAAAAG
GTGATCGGCA TGCGTCCGGA TTGGGCGGAA AGCCATCCGG AAACCGTTTC CCGGCTGATC
GTGGCGCTCG ACGCGGCAGC CCGCTGGTGC GACCGGCCGG ACAATCACGA CGCGCTGGCG
ACAGCGCTTG CCGATCCGCG TTATATCGCC GCCCCCGTCG GCATCATTCG CCGCGTCCTC
GCCGGCGAAT TCAGCCTCGA CGCTAGGGGC AACCGGCGCA TCATCGCCGA TTATTTCCAG
TTTCATTCCG GTTTCGCCAA TTATCCGAGG CCCAGCCATG CGCTTTGGAT CTACAGCCAG
ATGATCCGCT GGGGACAGGC CGAGCTCAGC CTCAACAAGG CGCGGGCGGC CGCCTCCGCC
TACCGTCCCG ATCTCTATCG CAGAGCGCTC GGCGACGCCA ATGCACCTGA TGACGCCGAT
ATCCGCATCG AAGGCAATGA CGAAGGCGAT CGCTTCATGG ACGGCCACGT CTTCGATCCG
ACGGAACTGC CGGACTATGT CGCGGGCTTT GCTGTCAAAA GCGCCCTGCC CGTCGCTTCC
CATAGCGATG AGACCTGA
 
Protein sequence
MMTNTASSSG GLPSPAHINS EGPKVLRAGF IPLVDASVLI AAAEFGFAQK EGLTLDLVKD 
VSWANVRDRL AFRQFDIAHM LSPMPVASML GLGSNPSPTI TPFSLGRGGN AITLSTRLFD
RMRDDAGLPE TASALDNAHA LAKTVAAMKA RGEPLPTFGV TYPFSSHNYE FRYWLAAGGI
NPDKDVKLVV VPPPLTSDAL AAGAIDGFCV GAPWNMVASE RGVGRIVAAK QDIWPSAPEK
VIGMRPDWAE SHPETVSRLI VALDAAARWC DRPDNHDALA TALADPRYIA APVGIIRRVL
AGEFSLDARG NRRIIADYFQ FHSGFANYPR PSHALWIYSQ MIRWGQAELS LNKARAAASA
YRPDLYRRAL GDANAPDDAD IRIEGNDEGD RFMDGHVFDP TELPDYVAGF AVKSALPVAS
HSDET