Gene Rleg_4754 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4754 
Symbol 
ID8007007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp122779 
End bp124641 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content63% 
IMG OID644821684 
Producthypothetical protein 
Protein accessionYP_002972944 
Protein GI241113109 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00204432 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.330996 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTCG ATGCGTTCCA GCTCTACGGC ACCCGCCTCG TTGAAACGCC GCCGGTTCGG 
CTGAGAGCCG GAAAACTGGA AGCCGATCTC GCCAATGGCA ACCTCCGCAC CATCCGCTAC
GATGGGACCG AGGTGCTGCG AGCGATCTCC TACCTCGTTC GCGACCCGGA CTGGGGCACC
TACAGCCCTG TAATTGTTGA TCTCCGCATC GAGCAGAGTG ACAATCGTTT CGCGGTCGCC
TATCGAGCCC GCTGCGAGGG ACCTGATGAC ACGAGGCTTG TCATTGACGT TCGCATCACC
GGAAGCGCGG ACCGGCTCGA CTTCGAGGCC GAAGCCATCA CAGAGACCGG CTTCGAGACC
AATCGCTGCG GCTTCTGCAT CCTGCATCCG ATCGTCGGCG TGGCGGGTTC ACCGGCGACG
GTCGAACATG TCGACGGCCG GAAAGTGGCA ACCCGGTTTC CCGATGTCAT CGAGCCCTGG
CAGCCTTTCA AGGACATGCG CGCCATCACT CATGCGATCA TGCCTGACGT TCAGGCGGAA
TGCCGGATGG AGGGCGACAC CTTTGAAATG GAAGACCAAA GGAACTGGTC GGACGCATCC
TATAAGGCAT ATGTCAGGCC GCTCGCCCTG CCCTGGCCAT ACCAGATTGC CGCCAATCAG
CCCGTTCGGC AAAAGACGTC GCTTGTTATC AGGGATATCG GCGGTTCGAC ACGGCATCCT
CCAGCTGCGG CGTCAGGCGG CGCCATAAAA CTCGAACTCG GGGCGCGAAC CGGCACCATG
CCTGATATCG GCGTGATCGT TACGCCCGAG GAAGCCGATG CGACACTGTC GGCAAAGTCC
GTGCTGTCGG AAATCGCTCC CCAGGAACTG CTCTTCCATT TCGACCCCAG TGCAGGACAC
GGCGTCGACG CGCTCACGCA GTTCGCCATG CTCGCCGCGG CCCATCTCGG CCGCTCGACG
CTGGAGATCG CCCTTCCCTG CACATCGTCG CCGTCAAGCG AGGTGGCCGA AATCGCCCAC
CAGATGCGGC TGGCGGAATT CAGGCCGGAT GCGATCATGA TCTCGCCTTC GGTTGACCGG
CAGTCGACGC CGCCCGGCAG CACATGGCCG GAATGCCCGC CTTTGGATGA AGTCTATACC
GCCGCTCGCG CCGCCTTTCC CGGCATTCGC ATCGGCGGGG GTATGCTGAG CTATTTCACC
GAGCTCAACC GGAAGCGCGT CCCGGATGGA CAGCTCGACT TCGTCAGCCA CTGCACCAAT
CCGATCGTGC ATGCCGCCGA CGATCTTAGC GTCATGCAGA CATTGGAAGC GCTGCCCTTC
ATCACACGGT CGGTGCGTGC GATCTACGGT GACAGACCCT ACCGGATCGG CCCGTCGACG
ATCCCGATGC GACAGAATCC CTATGGCAGC CGCACGATGG ATAATCCGTC GGGCGCACGC
GTTCCCATGG CCAACCGCGA CCCGCGTCAC AATGGACGCT TCGCGGAGGC CTTCGCGCTC
GGCTACGCGA TACGGGTACT GGATGCCGGT CTGGAATGCC TGACGCTCTC GGCCTTGTCA
GGCCCGTTCG GTCTGATCGC CGGTCCAGCC GAACCGACCG AGCAAGGCGG GCGGCGCCCG
CTGTTCAACA CAGTGCGGAC ATTGTCTCGA TTGGCTGGCG CATCCTGGCA GGCATGCGTC
TCCTCCTCGC CCTCCGAGGT GCTGTCTTTC GTTGCACGCG ATGCCGCAGG CGCCAGGCTT
CACGTCGTCA ATCTGACGGG CGAAGAACGA AAGGTCGATT GCGACGCCTG CCGGCCGGCA
GATTCGGGCA AAGAGTTTCT GCTCGCGCCG TTTGCGACCG TCGTCCTGCC GCTGGCGGAT
TGA
 
Protein sequence
MKVDAFQLYG TRLVETPPVR LRAGKLEADL ANGNLRTIRY DGTEVLRAIS YLVRDPDWGT 
YSPVIVDLRI EQSDNRFAVA YRARCEGPDD TRLVIDVRIT GSADRLDFEA EAITETGFET
NRCGFCILHP IVGVAGSPAT VEHVDGRKVA TRFPDVIEPW QPFKDMRAIT HAIMPDVQAE
CRMEGDTFEM EDQRNWSDAS YKAYVRPLAL PWPYQIAANQ PVRQKTSLVI RDIGGSTRHP
PAAASGGAIK LELGARTGTM PDIGVIVTPE EADATLSAKS VLSEIAPQEL LFHFDPSAGH
GVDALTQFAM LAAAHLGRST LEIALPCTSS PSSEVAEIAH QMRLAEFRPD AIMISPSVDR
QSTPPGSTWP ECPPLDEVYT AARAAFPGIR IGGGMLSYFT ELNRKRVPDG QLDFVSHCTN
PIVHAADDLS VMQTLEALPF ITRSVRAIYG DRPYRIGPST IPMRQNPYGS RTMDNPSGAR
VPMANRDPRH NGRFAEAFAL GYAIRVLDAG LECLTLSALS GPFGLIAGPA EPTEQGGRRP
LFNTVRTLSR LAGASWQACV SSSPSEVLSF VARDAAGARL HVVNLTGEER KVDCDACRPA
DSGKEFLLAP FATVVLPLAD