Gene Rleg_5583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5583 
Symbol 
ID8016474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012853 
Strand
Start bp165648 
End bp166736 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content64% 
IMG OID644827749 
Producthypothetical protein 
Protein accessionYP_002978949 
Protein GI241518321 
COG category[S] Function unknown 
COG ID[COG4641] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0202038 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCG CCTTTTACGG ATCGAGCCTG GTTTCCGCCT ACTGGAACGG CGCCGCCACC 
TACTATCGCG GCCTGCTGCG CGCTTTGGCG CGGAAGGGCT ACGACATCAC CTTCTACGAG
CCAGATGTTT ACGACCGGCA GAAAAACCGC GACATCGATC CCCCGGAATG GTGCAAGGTC
GTCGTCTATC AAGGCACCAT CGACGCGCTG AGGCAGGTGA CGGCGGCCGC GGCCGAGGCC
GATATCGTCG TCAAAGCCAG CGGCGTCGGC TTCGAGGACG ATCTCCTGCT GCAGGAAGTC
CTTCGCCATG CCAGGCAAGG GGCTTTGAAG ATCTTCTGGG ACGTGGATGC GCCGGCAACG
CTTTCCGAGC TGCGGCAGAC CCCCGAGCAC CCGCTTCGCA AGTCCTTGAG CCGGATCGAC
CTCATCCTCA CCTATGGTGG CGGCGACCCC GTGATCGACG CCTATCGCGG CCTCGGGGCA
GCCGACTGCG TGCCGATCTA CAACGCGCTC GATCCTCAAA CCCATCATCC GGTGCAGGAG
GAGGCGCGGT TCACCGCGGA TCTTGCCTTT CTCGGCAACC GTCTGCCCGA CCGTGAAGCG
CGGGTCGAGC AGTTTTTTCT CGAACCCGCG GCCCGCCTGC CGCGGCAAAG CTTTCTGCTC
GGCGGGTCCG GCTGGAGCGA CAAAGCCTTG TCGTCGAACA TCGTTCACAT CGGGCATGTC
CTGACCCGCG ACCACAACGC GTTCAACGCG ACGCCGAAGG CGGTGCTCAA TATTTCCCGT
ACCAGCATGG CCGAAAACGG TTTTTCGCCG GCAACCCGCG TTTTCGAAGC CGCAGGCGCC
GGCGCCTGCC TGATCACCGA CTACTGGCAA GGCATTGACC TGTTTCTGAA GCCCGGCGAA
GAAATCCTGG TGGCGCGCGA CGGCCAGGAT GTCGCCGATC TTTTGACCGG CCTGACATGG
CAGCAGGCCA GGGCGATCGG ACAGCGGGCG CTAAGACGTG TGCTTGCCGA GCATACCTAT
AGCAATCGCG CCGAGACCGC CGATGCCATC TTCCGCGCTC ATGCCGCGCG AGCGGAGGCG
GCCGAATGA
 
Protein sequence
MKIAFYGSSL VSAYWNGAAT YYRGLLRALA RKGYDITFYE PDVYDRQKNR DIDPPEWCKV 
VVYQGTIDAL RQVTAAAAEA DIVVKASGVG FEDDLLLQEV LRHARQGALK IFWDVDAPAT
LSELRQTPEH PLRKSLSRID LILTYGGGDP VIDAYRGLGA ADCVPIYNAL DPQTHHPVQE
EARFTADLAF LGNRLPDREA RVEQFFLEPA ARLPRQSFLL GGSGWSDKAL SSNIVHIGHV
LTRDHNAFNA TPKAVLNISR TSMAENGFSP ATRVFEAAGA GACLITDYWQ GIDLFLKPGE
EILVARDGQD VADLLTGLTW QQARAIGQRA LRRVLAEHTY SNRAETADAI FRAHAARAEA
AE