Gene Rleg_5090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5090 
Symbol 
ID8007683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp480015 
End bp481196 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content62% 
IMG OID644822005 
Productprotein of unknown function DUF900 hydrolase family protein 
Protein accessionYP_002973265 
Protein GI241113430 
COG category[S] Function unknown 
COG ID[COG4782] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0102585 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAAC AGCGTTTGAA AGAGTCTCCG ATGGGAGCCA AGGCGATGCT GGTCGCGATC 
GGCCTGGCGA TGCTCGCCGG TTGCGCCACG AGACCATCGC CTGATGTGTT GAACCCGGTC
CGTCTGCCGG TTCATCTCCC AGGTCATCTG AATGCATCGC TTCATTCCGA TGAAGCCGCT
GCGAACCACG TAAACGTGCT CGCTGCGACA AACAGGAGCC CCGACACTGC GCGCGGTGGG
TTCGGGAGCG CGTGGGCTGA CAATCTCACC TACGAGCAAT ATGCGTTTTC GGTCCCTCCC
AACCGGAAGG ACACCGCGAT CACATATCCG ACCGCGAGAC CGGATCCCGA ACGGCAATTT
GCCGTCATCG GGCGCAAGCA GCTTGCAAAG GCGGCCTTCG TGCAGGCAGC GCTGGGCTCC
GTTCAGTCCG ACGGCACTGT CGGCATCTTC GTCCATGGCT ATAACTATAG CTATCAGGAG
GCGCTGTTTC GCACCGCGCA GATCGCTGCG GACGCCAATA TTCCGGGCTC TCCGATTCTG
TTTTCGTGGC CTTCGGCCGC TGCCGTCGCC GGCTATGTCG CCGACCGCGA TGCGGCGCTG
TCCTCGCGTA GCGACCTCGA TTCGCTTATC ACCTCGCTCT CGGCTTCAGG AAAGGTGAAA
CGCGTCATCC TTTTCGGACA CAGCATGGGC GGATTCCTGG TCATGGAGAC AGTGCGTGAG
CTCAAACTGC AGCATCGCGA CGACGTCATC GGTAAACTGG CGGTGATCCT CGCCGCCCCT
GACATCGACG TCGATGTTTT CCGGTCGCAG TTGAAGGATA TCGGGCGGAT GCCGATCCCA
ATATCCCTTC TCGTTTCGAA GGACGACAGG GCGCTGGTGG CCTCGAGCTT CATAGCCGGA
GAGCGGGCGC GGGTCGGACG CCTCGATATC GACGATCCCG TCATCAGGGA GGCTGCCTTG
AAGGAAAGGC TTCGGGTCAT CGACATCACG TCGATCCAGG CGTCCGACGG GATGGGGCAC
GACCGCTACG CATCGCTCGC CAAGTTCGGC GCGCAGCTTG CCTCCTTCGA AAGTGGGAAG
CGTTCGACCG CCGGCGAGGT TGGCGCCTAT GTCTTCGATG CCGCCGGCGC CGCGGTCGCA
AGTCCATTTC GTCTGGCCGG ACGTGTCGTC GGCTCGCAAT GA
 
Protein sequence
MTEQRLKESP MGAKAMLVAI GLAMLAGCAT RPSPDVLNPV RLPVHLPGHL NASLHSDEAA 
ANHVNVLAAT NRSPDTARGG FGSAWADNLT YEQYAFSVPP NRKDTAITYP TARPDPERQF
AVIGRKQLAK AAFVQAALGS VQSDGTVGIF VHGYNYSYQE ALFRTAQIAA DANIPGSPIL
FSWPSAAAVA GYVADRDAAL SSRSDLDSLI TSLSASGKVK RVILFGHSMG GFLVMETVRE
LKLQHRDDVI GKLAVILAAP DIDVDVFRSQ LKDIGRMPIP ISLLVSKDDR ALVASSFIAG
ERARVGRLDI DDPVIREAAL KERLRVIDIT SIQASDGMGH DRYASLAKFG AQLASFESGK
RSTAGEVGAY VFDAAGAAVA SPFRLAGRVV GSQ