Gene Rleg_4919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4919 
Symbol 
ID8007515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp298242 
End bp299270 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content57% 
IMG OID644821839 
Productnodulation factor exporter subunit NodI 
Protein accessionYP_002973099 
Protein GI241113264 
COG category[V] Defense mechanisms 
COG ID[COG1131] ABC-type multidrug transport system, ATPase component 
TIGRFAM ID[TIGR01288] ATP-binding ABC transporter family nodulation protein NodI 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGGTC AATCGGATCT ACAGGCAGCT GTGGCTGAGA CGCTCTGCCG CGGAAGCAAC 
CTACCGTGCA TTTCGATTTC AGAAGCAATC GCTCCCCAAG TAGGAGCGAT ATCTTCTATT
GCGATCGAGC TCGTCGGCGT TACCAAATCA TATCGTGGGA AGGCCGTGGT TGACGGATTG
TCTTTCAACA TTGGGTCGGG AGAGTGCTTT GGCCTTTTAG GCCCAAACGG CGCCGGAAAA
AGTACGATCA GCCGTATGAT TCTAGGAATG ACGTCGCCCG ATGCGGGCAC TATTTCGGTG
CTCGGAGCGC AGGTACCCCG GCAGGCTCGT TCGGCGCGCG CCCGTATCGG TGTTGTTTCT
CAGTTCGACA ATCTCGACAT GGAGTTCACG GTTAGAGAAA ACCTGATTGT CTACGGCCGG
TACTTCCGCA TGAAAGCCAG GGAGATCGAA GCGATCTTGC CCTCGCTGCT GGAATTTGCG
CGGCTTGAAA ACAAGGCAGA TACAAGGGTG GCGGATCTTT CTGGAGGCAT GAAGCGGCGC
CTGTCATTGG CGCGCGCCCT GATCAATGAC CCGCAAATCC TCATATTAGA TGAACCGACC
ACCGGCCTTG ACCCGCACGC ACGCCACCTG ATCTGGGAGC GTTTGCGATC GCTATTGGCG
CAAGGCAAAA CGATCCTGTT GACGACCCAC ATCATGGAAG AGGCAGAACG CTTATGTGAC
CGGCTGTGCG TGCTCGAAGG TGGAGTTAAG ATAGCCGAAG GCCGCCCCTT TGACCTGATA
AAGGAGCAGA TCGGCTGCCC CGTCATCGAG ATCTATGGCG GGGATCCGCA GGAGCTTAGC
CTCTTGATCA AGCCATACGC ACGGCGCATC GAAATCAGCG GCGAGACCCT GTTCTGCTAC
ACCCCCGACC CAGAACAAGT TCGGGCGCAA CTGCGCGGAC ACTGGGGTCT GCGCCTCCTG
GAGCGGCCGC CAAACCTAGA GGACGTCTTC CTGCGGTTAA CCGGACGCGA GATGGGGAAG
TACCAATGA
 
Protein sequence
MNGQSDLQAA VAETLCRGSN LPCISISEAI APQVGAISSI AIELVGVTKS YRGKAVVDGL 
SFNIGSGECF GLLGPNGAGK STISRMILGM TSPDAGTISV LGAQVPRQAR SARARIGVVS
QFDNLDMEFT VRENLIVYGR YFRMKAREIE AILPSLLEFA RLENKADTRV ADLSGGMKRR
LSLARALIND PQILILDEPT TGLDPHARHL IWERLRSLLA QGKTILLTTH IMEEAERLCD
RLCVLEGGVK IAEGRPFDLI KEQIGCPVIE IYGGDPQELS LLIKPYARRI EISGETLFCY
TPDPEQVRAQ LRGHWGLRLL ERPPNLEDVF LRLTGREMGK YQ