Gene Rleg_2143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2143 
Symbol 
ID8013161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2132754 
End bp2134496 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content60% 
IMG OID644824729 
Productprotein of unknown function DUF521 
Protein accessionYP_002975959 
Protein GI241204863 
COG category[S] Function unknown 
COG ID[COG1679] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.180277 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.183216 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGACA TACAACAACA GGCAGACGAA AAGGTGTTTG CCGGGCGCGC GCTTGTCGCT 
GGCTTCGCTA CGGGGTCGAT CGTATTCAGC GACACGGCGC TCAGCTTCTG GGGCGGCGTC
GACTCCCAGA CGGGAGAAGT CATCGACCGT CACCATCCCC TGTCGACACA GGTCCTGACA
GGAAAGATTC TCGCCATCCC CGGCGGACGC GGTTCGTGTA CGGGCAGCAG CGTGCTTATG
GAGCTGATCA TGAACGGGCA CGCTCCCGCC GGCATCGTGG TCTCGCGCCA GGAGGAAATA
CTGTCGCTCG GAGTGATCGT CGCTGACGAA GTGTTCGGCC GATCGATCCC CGTTGTCCAG
CTCTCCGAAG ACGACTTCGC CGAGCTGCAC TCTATCCCTG AAGTCACCCT CATCGGCGAC
AAGGTCATCG CGAGCTGGAT CGAAACCGCC CCCGGTTTCG ATGCCGCCGA TCGCACCTAT
GGAAATTCGA TAGCGCTGAC ATCGAGGGAT CGGTCGGCTT TGAACGGGGA GATGGGCAAG
GCTGTGCAGG TGGCAATGAG GGCCACGACG CGCATGGCTG AAATTCAGGG AGCGACGGAA
CTTATCGACA TCTCCCAGGT CCACATCGAT GGCTGCATCT ACACCGGGCC GACCAGCCTG
GAATTCGCGA AACGCATGCG GGACTGGGAG GGTAGGGTTG TCGTTCCAAC GACCCTGAAC
TCGATATCGG TGGACCAGAT GCGTTGGCGG GAGCAGGGCG TGTCTCCCGG CGTCGCCGGA
CCCGCATCCG AATTGGGTGA GGCTTACGCC TCCATGGGAG CACGAAAGAC GTTCACCTGC
GCGCCCTATC AGCTGTCGTC CGCACCCAGG CAGGGTGAGC AGGTGGCCTG GGCGGAGTCG
AACGCCGTCG TATTCGCGAA CAGTGTTTTG GGTGCTCGCA CTGCCAAGTA TCCTGACTAC
CTGGATCTGT GCATCGCTCT CACCGGGCGC GCGCCGCTGA CCGGGCCGCA TATCGCCGAC
AACAGGCGCG CCAGCCTCGT GGTCAATGTG TCGGGCTTCG TGTCTTGGGA TGACATGGTC
TACCCCATTC TCGGCTACCA CATCGGCAAG CTTGTCGGAG ATGAAATTCC GGTCGTGATC
GGCCTCGAGA CTTGGAAGCC GAATCTGGAC GACCTAAAGG CGTTCGGGGC GGGTTTTGCG
ACGACATCAG GCTCTCCGAT GTTCCATATC GTCGGTGTCA CGCCGGAGGC CGACAGCCTT
GAGAGCATTG TCGGAAGCAA CATCAAGGCG AGCTACGAAA TCTGTCCCAA GGACGTCGTT
GCGGAGTGGA GGAAACTCAA CGGCGGGTCT GTGGATGCCA TCGAGTTCGT TGCGCTGGGC
AATCCCCATT TCTCTTTCGA CGAATGCGAG CGGCTGGCCG CGCTCTGCGA AGGCTTGGCG
AAACATCCCG ACGTCAAGGT TCTGGTTACC TGCAACCGTG CGACGTTCGA AAGGGCTTCC
GCTGCGAATC TCGTCGGCAA ATTGTCGGAC TTCGGCGTAG AATTCGTAAC CGATGCGTGC
TGGTGCACAC TTGCGGAGCC TGTCATCCCA AAATCGGTCG ATACAATTAT CACCAATTCC
GCGAAGTTCG CTCACTACGG TCCCGGATTG ACGGGTAAGG CGCTTCGGTT TGGCAGCCTG
GCGGACTGCG TCGCTGCTGC TTGTCAGGGC GAGTTGAAGC CTGTTCGTCC CACATGGGAC
TGA
 
Protein sequence
MPDIQQQADE KVFAGRALVA GFATGSIVFS DTALSFWGGV DSQTGEVIDR HHPLSTQVLT 
GKILAIPGGR GSCTGSSVLM ELIMNGHAPA GIVVSRQEEI LSLGVIVADE VFGRSIPVVQ
LSEDDFAELH SIPEVTLIGD KVIASWIETA PGFDAADRTY GNSIALTSRD RSALNGEMGK
AVQVAMRATT RMAEIQGATE LIDISQVHID GCIYTGPTSL EFAKRMRDWE GRVVVPTTLN
SISVDQMRWR EQGVSPGVAG PASELGEAYA SMGARKTFTC APYQLSSAPR QGEQVAWAES
NAVVFANSVL GARTAKYPDY LDLCIALTGR APLTGPHIAD NRRASLVVNV SGFVSWDDMV
YPILGYHIGK LVGDEIPVVI GLETWKPNLD DLKAFGAGFA TTSGSPMFHI VGVTPEADSL
ESIVGSNIKA SYEICPKDVV AEWRKLNGGS VDAIEFVALG NPHFSFDECE RLAALCEGLA
KHPDVKVLVT CNRATFERAS AANLVGKLSD FGVEFVTDAC WCTLAEPVIP KSVDTIITNS
AKFAHYGPGL TGKALRFGSL ADCVAAACQG ELKPVRPTWD