Gene Rleg_3646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3646 
Symbol 
ID8014495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3685217 
End bp3687634 
Gene Length2418 bp 
Protein Length805 aa 
Translation table11 
GC content66% 
IMG OID644826209 
Productprotein of unknown function DUF404 
Protein accessionYP_002977428 
Protein GI241206332 
COG category[S] Function unknown 
COG ID[COG2308] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAAGA GGCCGGCAGC GGAGCGCAGG GGAGAGGCAG AGGTCCGCAT CGCCAAGGGT 
GCGGCCTTCG GCTATGCGTC GCTTCCTGGC ACCGCTGACG AGATGGTCGA CAACAAAGGC
GCAGTCCGCC CCGTCTGGCA GAATTTCCTC TCGCATCTGA GCGCAATGCC GGAAAAGGAT
CTCGCCGAGC GTTTTGCCCG CGCCGACCGC TACCTGCGCG ATGCCGGCGT CTTCTACCGC
GCCTATGGCA GCAAGGGCAC CGGCGAACGC GCCTGGCCGA TTTCGCATAT CCCGGTGCTG
ATCGACGAGC GCGAATGGAA GACGCTGTCG GCGGGGCTCG TCCAGCGCGC CGACCTGCTG
GAGGCGATCG TTGCCGATAT TTACGGCGAC AACCGGCTGG TGGAGGAAGG GGTCCTGCCG
CCGGCGCTGA TGGCCGCCAA TCCCGAATTC CAGCGCCCGC TCGCCGGTAT CCGGCCGGGC
TCCGGCCATT ATCTGCATTT CTGCGCCTTC GAGATCGGCC GCGGACCTGA CGGCAACTGG
TGGGTGCTGG CCGACAGGAC GCAGGCACCG TCTGGCGCCG GTTTCGCGCT GGAAAGCCGC
GTCGCGACGA CCCGGGCCTT CTCGGATATC TACGCCGAAA CCCCGGTCCA CCGCCTCGCC
TCCTTCTTCG GCGCCTTCCG CGACGCACTA CAGGGTATGA AACATTCGGG CGACGACCGC
ATCGCCGTGC TGACGCCCGG CCCGGCCAAC GAGACCTATT ACGAGCACGC CTACATCGCC
CGCTATCTCG GTCTCATGCT GCTCGAAGGC GAGGACCTCA CTGTCGTGAA GGGCCGCGTC
ATGGTGCGCA CCGTTGCCGG CCTGAAGCCG ATCGGGGTGC TCTGGCGCCG TCTCGATTCG
GCCTTTGCCG ACCCGCTGGA GCTGAACCAG AATTCGCATA TCGGCACGCC CGGCCTGGTG
GAAGCGCTGC GCGCCGAAAG CCTCACCATC GTCAATGCGC TCGGCACCGG CGTTCTCGAA
ACCCGGGCGC TTCTGGCCTT CATGCCGACC ATCTGCCACC GTCTGCTGGG GGAAGATCTG
CAATTGCCCT CGATCGCTAC CTGGTGGTGT GGCCAGAAGG AAGAGCGCGA GCACGTCGCA
AAGAACATCG AGAAGATGGT GATCGGCCCG GCCTATTCCC GAGCACCCTT CTTCGACGAC
AACGGCGAGT CCGTGCTCGG CTCGTCGCTG CGTGCGACCG CCAAGGATTC CATCACCGAC
TGGCTGAGTT CGGACGGCCC GAAACTGGTC GGCCAGGAGG TCGTCACGCT GTCGACGACG
CCCGCCTGGG TGGACGGCAA GCTCGTGCCG CGGCCGATGT CGCTGCGCGT CTTTGCCGCC
CGCACGGCAA ACGGCTGGCA GATCATGCCC GGCGGCTTTG CCCGTATCGG CTCCGGCGCC
GATGTCGCGG CGATCGCCAT GCAGTCGGGC GGAGCGGCCG CCGACGTCTG GATCGTCAGC
GACAAGCCGG TCGAGCGCCA CACGCTGCTG CCGGCCGAGG GCAGCTTTAC CCGCAACATG
CCGGGCAGCC TGCCAAGCCG GGCAGCCGAC AATTTGTTCT GGCTCGGCCG TTACATCGAA
CGTGCCGAAG GGGCGCTGCG CATCCTGCGC GCCTGGCATG CGCGTTACGC CGAAGCTGCC
GATCCGAGCC AGCCGCTGCT CGCCGATGTC TCCGAGTATC TCACGGCCGT CGATATCGAT
ACCGCCGAAC CCGTGCCGGA AACGCTCTTG CGCAACATCG ACAGCGCCGT TTATTCGGCG
AGCAACATCC GCGACCGTTT CTCCCCGGAC GGCTGGCTGG CACTCAACGA TCTCGCCAAA
ACCGCCCGCC GCTTTCACGT CACCGTCGCT GCCGGCGACG ACGCCAGCCA TGCGATGACG
ATCCTTCTGC GCAAGCTCGC GGGCTTCGCC GGTCTCGTGC ACGAGAACAT GTACCGCTTC
ACCGGCTGGC GCTTCCTCTC GCTCGGCCGC TATATCGAGC GCGGCCTGCA CATGACGCGC
CTGCTCGGCC ACATGTCCGG CCCGGAAGCG CCCGACGGCG CGCTCGACAT GCTGCTCGAA
ATCGGCGACA GCGTCATGAC CCATCGCCGC CGCTACAACG TCAACACGGC GCGGCTGACC
GTCACCGACC TGCTGGCGCT CGACCCTCTC AATCCCCGCT CGGTCCTTTT CCAGGTGAAC
GAGATCCACC ATGAAGTCGA GCAGTTGCCG AATGCCCTGA TCAACGGCCA GATGTCGCCC
TTCTACCGCG AAGCGATGCG GCTCCATTCC GGCCTGGCGG TGATGACGCC GGAGGGTATG
GGGGCCGAGG TCTATCAACG GCTAGAACGC GAATTGGAGC AGCTTTCCGA TCTGCTCGCC
CAAACCTATC TCGGGTGA
 
Protein sequence
MGKRPAAERR GEAEVRIAKG AAFGYASLPG TADEMVDNKG AVRPVWQNFL SHLSAMPEKD 
LAERFARADR YLRDAGVFYR AYGSKGTGER AWPISHIPVL IDEREWKTLS AGLVQRADLL
EAIVADIYGD NRLVEEGVLP PALMAANPEF QRPLAGIRPG SGHYLHFCAF EIGRGPDGNW
WVLADRTQAP SGAGFALESR VATTRAFSDI YAETPVHRLA SFFGAFRDAL QGMKHSGDDR
IAVLTPGPAN ETYYEHAYIA RYLGLMLLEG EDLTVVKGRV MVRTVAGLKP IGVLWRRLDS
AFADPLELNQ NSHIGTPGLV EALRAESLTI VNALGTGVLE TRALLAFMPT ICHRLLGEDL
QLPSIATWWC GQKEEREHVA KNIEKMVIGP AYSRAPFFDD NGESVLGSSL RATAKDSITD
WLSSDGPKLV GQEVVTLSTT PAWVDGKLVP RPMSLRVFAA RTANGWQIMP GGFARIGSGA
DVAAIAMQSG GAAADVWIVS DKPVERHTLL PAEGSFTRNM PGSLPSRAAD NLFWLGRYIE
RAEGALRILR AWHARYAEAA DPSQPLLADV SEYLTAVDID TAEPVPETLL RNIDSAVYSA
SNIRDRFSPD GWLALNDLAK TARRFHVTVA AGDDASHAMT ILLRKLAGFA GLVHENMYRF
TGWRFLSLGR YIERGLHMTR LLGHMSGPEA PDGALDMLLE IGDSVMTHRR RYNVNTARLT
VTDLLALDPL NPRSVLFQVN EIHHEVEQLP NALINGQMSP FYREAMRLHS GLAVMTPEGM
GAEVYQRLER ELEQLSDLLA QTYLG