Gene Rleg_6283 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6283 
Symbol 
ID8016154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012852 
Strand
Start bp347467 
End bp348612 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content60% 
IMG OID644827586 
ProductNMT1/THI5 like domain protein 
Protein accessionYP_002978786 
Protein GI241258902 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.101312 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGGTC CGCAGGTGGC CCGATCGGTG ACATCGGCCC GCAATAGCGA GCGCAATCGG 
TCGGTCGTCC AACAATCCAA CAAGGGGAAA GGCAAGCAAA TGAACGGGGA AGACAAACAA
ATCGTGGTCG GCAGGCGTAC CGTCCTGAAG GGCGGAGCCT TTGCCCTTGC TGCAGCGACG
GCAGGGATCA GCGTGTTCGT GCCGCGTCAC TCCAAAGCCG CCGCATCCAA GGTCGTCATC
AAATATGACT GGCTGATGAG CAACGGACAG ATCGGCGATA TCGTCGCAGT CAAGCGTGGG
CTGTTCGAGG CCGAGGGTCT CGACGTCGAG TTTTCCCCTG GTGGTCCCAA TTCGGCAACG
GTGCCGCCCG TGATCACGGG TGATGCGCAG CTCGGCCAGT TCTCGGATTC GGCACAGCTT
CTTCTTGCCA GGTCATCCGG CGTGCCGATC AAGATCTTCG CCTGCGGTTT CCGCATGGCG
CCTTTCGCCT TCTATTCGCT GCCCAAGGCG CCGATCCGCA CCGTCAAGGA CATGATCGGC
AAGCGCATCG GCATCCAGCC GACGGCTCGT TATGTCCTTG ATGCCATCCT GCTGAAGAAC
AATATCGATC CCTCGAGCCT GACCATCACC AATATCGGCT TCGACATGAC GCCGCTGATG
ACCGGTCAGG TCGATGCAGT GACCGGATGG ATCACTAACA CGCAAGCCCT TTCCATCATC
GGCCCCGACC GCATTGATCT GATAATGAAG GACACGGGCC TGCCGTCCTA CGCCAACGTC
TATTTTGCCA CCGACGATGC CGTGACCGGC CATGCTGAGA CATTGGCAAA GGTGTTGCGT
GCGGTCGCCA AGGGTTGGGC CTGGACGCAT GACCATCCCG AAGAGGCGGT CAAATTGACG
GTGGAGGCCT ATCCGCAGCT CGACCTTGCC GTGGAGCTGA AGACGATACC GCGCATATTG
TCGCTGAGCT TCGACGCAGC AACGGGTAAG GATGGCTGGG GCAGTTTCGA TCCGGCGGCG
CTTGCCGAAC AGATTTCCGT CTACGACAAG ATCGGCCAGT TCAAGAGCGG CGCGCCGAAG
CTGGAGGACT GCTATACGGC CAAAATCCTG GACATGACGG CGGACGACCG CCCGAAGATT
GCGTGA
 
Protein sequence
MRGPQVARSV TSARNSERNR SVVQQSNKGK GKQMNGEDKQ IVVGRRTVLK GGAFALAAAT 
AGISVFVPRH SKAAASKVVI KYDWLMSNGQ IGDIVAVKRG LFEAEGLDVE FSPGGPNSAT
VPPVITGDAQ LGQFSDSAQL LLARSSGVPI KIFACGFRMA PFAFYSLPKA PIRTVKDMIG
KRIGIQPTAR YVLDAILLKN NIDPSSLTIT NIGFDMTPLM TGQVDAVTGW ITNTQALSII
GPDRIDLIMK DTGLPSYANV YFATDDAVTG HAETLAKVLR AVAKGWAWTH DHPEEAVKLT
VEAYPQLDLA VELKTIPRIL SLSFDAATGK DGWGSFDPAA LAEQISVYDK IGQFKSGAPK
LEDCYTAKIL DMTADDRPKI A