Gene Rleg2_3922 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3922 
Symbol 
ID6982686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4070735 
End bp4071904 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content63% 
IMG OID643398645 
Productflagellin domain protein 
Protein accessionYP_002283410 
Protein GI209551493 
COG category[I] Lipid transport and metabolism 
COG ID[COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.172476 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.265134 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAACG GCTATCATCA GGCTTATGCT GCCTGGAAAC GGGATCCCGA AGCCTTCTGG 
CGCGAAGCCG CCGCCGACAT CGACTGGTTT AAACCGCCGG CGCGGGTGTT TTCGCCCGAG
GAAGGCGTCT ATGGCCGCTG GTTCTCAGGG GCTGAAACCA ATACCTGCCA CAATTGCCTC
GACCGGCATG TGACTGCCGG GCGTGGTGGC GAGATGGCGG TTATTTTCGA CAGTGCGATG
ACCGGCGAGA AGCGCCGTTT CACCTACGAC GAAGTCCTTG ACGAAGTGAA GGCCATCGCC
GCGACGCTTG TTGATCTCGG GATCGGTCGG GGCGATCGCG TCATCCTCTA TATGCCGATG
GTGCCGCAGG CGGTGTTTTC GATGCTCGCC TGCGCCCGCA TCGGTGCGGT TCACTCCGTC
GTCTTCGGTG GTTTTGCCGC CAGCGAGCTT GCTGCCCGCA TCGATGATTG CGGTGCGAAG
CTGGTGATCA CCGCGAGCTG CGGGCTCGAG CCCGGCCGCA TCGTTGCCTA TAAGCCCCTG
GTCGACCAGG CGCTCACGTT GGCGCGTTCG AAGCCGGAGC GCTGTCTGGT GCTGCAGCGG
CCGGAGCTTC GGGCGGATCT CGTCAGCGGC CGCGATCAGG ATTTCGAGGC GGCGGTGGCG
CAGCATCGCG GCGCCGAGAT CGCCTGTGTT CCGGTCAAGG CGACCGACCC GCTCTATATC
CTTTACACCT CGGGAACCAC CGGCCAGCCG AAGGGCGTCG TGCGCGACAC CGGCGCGATC
GAGACGAGCA GCGGGATCAT CGGCACGGCC TTCAACGGAA CCTATGGCGG CACCTTCATC
GTTATGGCGT CGATCTATGA TCTCGATATC ACCGGTTTCA CCCAGGGCCA GCTCGATTCA
GCCCTGACCG GCGTCGAACT GGTTTTGGGT GCCATGACCG CCGCCGGCTC GGCTCTCGGC
TCGATCTCGA CCCGTATCCA GCTGCAGGAA AATTTCGTCA GCGGTCTTCA CGATTCGATC
GACTCCGGCG TCGGCCGCCT GGTCGATGCC GATATGGAAG AGGAATCGAG CAAGCTGTCG
GCGCTGCAGA CGCAGCAGCA GCTCGCCGTC CAGTCGCTGT CGATCGCCAA CAGCTCGGCG
CAGAACATCC TCACCCTGTT CCGCAGCTAA
 
Protein sequence
MQNGYHQAYA AWKRDPEAFW REAAADIDWF KPPARVFSPE EGVYGRWFSG AETNTCHNCL 
DRHVTAGRGG EMAVIFDSAM TGEKRRFTYD EVLDEVKAIA ATLVDLGIGR GDRVILYMPM
VPQAVFSMLA CARIGAVHSV VFGGFAASEL AARIDDCGAK LVITASCGLE PGRIVAYKPL
VDQALTLARS KPERCLVLQR PELRADLVSG RDQDFEAAVA QHRGAEIACV PVKATDPLYI
LYTSGTTGQP KGVVRDTGAI ETSSGIIGTA FNGTYGGTFI VMASIYDLDI TGFTQGQLDS
ALTGVELVLG AMTAAGSALG SISTRIQLQE NFVSGLHDSI DSGVGRLVDA DMEEESSKLS
ALQTQQQLAV QSLSIANSSA QNILTLFRS