Gene Rleg2_3233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3233 
Symbol 
ID6981985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3321397 
End bp3322884 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content64% 
IMG OID643397950 
Producthypothetical protein 
Protein accessionYP_002282726 
Protein GI209550809 
COG category[S] Function unknown 
COG ID[COG4222] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.827086 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAACG TCAGACCATA CGCATCGCGG CTGTTTGCTG CGGTTCTCGC GGCATCCGCC 
GCCATTCCGG CCGCAGCAAG CGCGGAGAAT TCCGCCGCCG TCGGCAGCCT TACCTTCGTC
AACAAGGGCC TGGTCGGCGT CGGCCGCATC CCGGCCAACC AGCGCGACAA ATTCGGCGAA
ACCTTCGGCT CCGGCTCCGG CATGGCGATC GATTCCGCCG CCTGGAGCCG CGATGGCGCC
GGCTACAAGG GCACGCTCTA CCTGTTGCCC GACCGCGGCT ATAACGCCGT CGGCACCGTC
GACTATCGGC CGCGCCTGAA CACCATTGCG ATCGGCCTGA CGCCGACCGC TCCGGGTGCG
GCACCCGAGG CCGGCAAGGA GCAATCCGGG GTCGATGCGA AACTTGTCGA TTCCACCCTG
TTTGTCGATG ACAAGGGCGG CGACATGACC GGCCTCGACC CGGAATCCGG CGTCCGCGCC
GCTGCCGGCG ACTTTCCGCC GCTGCCGCAG GCGGTGAACG GCAAGATCGC CCTCGACAAT
GAAGCCATCA TCCGCATGGC CGACGGCAGC ATGTTCGTCA GCGACGAATA CGGCCCCTAT
ATCTATCGCT TCGCAGCCGA CGGTCACCTG CTCTCGGCCA CCCAGCCGCC GAAGGCGCTG
TTGCCGATGC GCAAGGGCGC GCTGAGCTTC GCCTCCAACA ATCCCGGCCC CGGCGCATCC
GCTCCGGATC CGAAGGACCC GGAGACCGGC CGCCAGAACA ACCAGGGCCT TGAAGGCATG
GCGATGACGC CGGACGGCAG GTTCATCATA GCAGTACTGC AGTCGGCGGC TCGCCAGGAT
GGCGGCGATT CCGGCTCGAC CCGCCAGAAC ACCCGCGCCA TGATCTATGA CGCCGCCGAT
CCCGATCACC TGAAGCTGGT GCACGAATAT GTCGTGCCGC TGCCGGTCTT CAAGGATGCC
AAGGACAAGA CAGTGATCGC GGCAGAGAGC GAAATCGTCG CCCTTTCCGA CAAGAGCTTC
CTGATGCTTG CCCGCGACAG CGGCAACGGT CAGGGCCTGA AGGGCGACAC CTCGCTCTAC
CGCAAGATCA ACATCGTCGA TCTCTCCACT GCGACCGATA TCGCCGGCAG CGATTTCGAT
GCCGGCAAGC CGATCGCGCC GAAGGGCGTC GTCGATCCTT CGCTGACGCC GGCGACGCTG
ACGCCGTTCA TCGACATCAA CGACAAGGCC GAGCTTGCCC GCTTCGGCCT GCACAATGGC
GCACCGAACG ACAAGAACAA TCTGTCGGAA AAATGGGAAG CCATGGGGCT TGCGAGCGTT
CTCGATCCGA ACCTGCCGGA CGACTATTTC CTGTTCGTCG CCAATGACAA CGACTTCCTG
ACGCAGGATG GTTTCCAGGT GGGCGCAGCT TACAAGGCGG AGGGCGGCGC CGACGTCGAC
ACCATGTTCC AGGTCTTCCA GGTCACCCTC CCCGGCTTGA GGAAGTAG
 
Protein sequence
MNNVRPYASR LFAAVLAASA AIPAAASAEN SAAVGSLTFV NKGLVGVGRI PANQRDKFGE 
TFGSGSGMAI DSAAWSRDGA GYKGTLYLLP DRGYNAVGTV DYRPRLNTIA IGLTPTAPGA
APEAGKEQSG VDAKLVDSTL FVDDKGGDMT GLDPESGVRA AAGDFPPLPQ AVNGKIALDN
EAIIRMADGS MFVSDEYGPY IYRFAADGHL LSATQPPKAL LPMRKGALSF ASNNPGPGAS
APDPKDPETG RQNNQGLEGM AMTPDGRFII AVLQSAARQD GGDSGSTRQN TRAMIYDAAD
PDHLKLVHEY VVPLPVFKDA KDKTVIAAES EIVALSDKSF LMLARDSGNG QGLKGDTSLY
RKINIVDLST ATDIAGSDFD AGKPIAPKGV VDPSLTPATL TPFIDINDKA ELARFGLHNG
APNDKNNLSE KWEAMGLASV LDPNLPDDYF LFVANDNDFL TQDGFQVGAA YKAEGGADVD
TMFQVFQVTL PGLRK