Gene Rleg2_3344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3344 
Symbol 
ID6982098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3445647 
End bp3448064 
Gene Length2418 bp 
Protein Length805 aa 
Translation table11 
GC content67% 
IMG OID643398062 
Productprotein of unknown function DUF404 
Protein accessionYP_002282837 
Protein GI209550920 
COG category[S] Function unknown 
COG ID[COG2307] Uncharacterized protein conserved in bacteria
[COG2308] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.550861 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAAGA GGCCGGCAAC AGAGCGCAGG GAAGAGACAG AGGACCGCAC CGGCAAACGT 
GCGGCCTTCG ACTATGCGCC GCTTCCCGGC ACCGCCGACG AGATGGTTGA CAACAAAGGC
AGGGTCCGCC CGGTCTGGCA GCATTTCCTC TCGCATCTGA GCGCAATGCC GGAAAAGGAT
CTTGCCGAGC GTTTTGCCCG CGCCGACCGC TACCTCAGAG ACGCCGGCGT CTTCTACCGG
GCCTATGGCA GCAAGGGCAC CGGCGAACGC GCCTGGCCGA TCTCGCATAT CCCGGTGCTG
ATCGACGAGC GCGAATGGCA GACGCTGTCG GCCGGCCTCG TCCAGCGCGC TGACCTGCTG
GAAGCGATCG TCGCCGACAT CTACGGCGAC AACCGGCTGG TGGAGGAAGG CGTCCTGCCG
CCGGCGCTGA TGGCCGCCAA TCCCGAATTC CAACGCCCGC TTGCCGGCAT CCGGCCGGCC
TCCGGCCATT ATCTGCATTT CTGCGCCTTC GAGATCGGCC GCGGGCCGGA TGGCAATTGG
TGGGTGCTGG CCGACAGGAC GCAGGCGCCG TCGGGCGCCG GTTTCGCGTT GGAAAGCCGC
GTCGCGACCA CCCGGGCGTT CTCGGATATC TATGCCGAAA CCCCGGTCCA CCGCCTCGCC
TCGTTCTTCG GCGGCTTCCG CGACGCGCTG CAGGGGATGA AACATTCCGG CGACGACCGC
ATCGCCGTGC TGACGCCGGG CCCCGCCAAC GAGACCTATT ACGAACACGC CTACATCGCC
CGCTATCTCG GCTTCATGCT GCTCGAGGGT GAGGATCTCA CCGTCGTCAA GGGCCGCGTC
ATGGTGCGCA CCGTCGCCGG CCTGAAGCCG ATCGGCGTGC TCTGGCGCCG TCTCGATTCG
GCCTTTGCCG ACCCGCTCGA ACTGAACCAG AATTCGCATA TCGGCACCCC CGGCCTCGTC
GAAGCGCTGC GCGCCGAAAG CGTCACCATC GTCAATGCGC TCGGAACCGG CGTTCTCGAG
ACCCGGGCGC TGCTGGCTTT CATGCCGACC ATCTGCCGCC GCCTGCTGGG GCAGGATCTG
CAACTGCCCT CGATCGCCAC CTGGTGGTGC GGCCAGGAAG ACGAACGCGA GCACGTCGCC
AAGCATATCG AGAAGATGGT GATCGGCCCG GCCTATTCCC GCGCCCCCTT CTTCGACGAC
GACGGCGAAT CCGTGCTCGG CTCGTCGCTG CGGGCGACCG CCAAGGATTC CATCACCGAC
TGGCTGAGTT CGGACGGCCC GAAACTGGTG GGACAGGAGG TCGTCACGCT GTCGACGACG
CCCGCCTGGG TCGACGGCAA GCTGGTGCCG CGGCCGATGT CGCTGCGCGT CTTCGCCGCC
CGCACGGCGA ACGGCTGGCA GATCATGCCC GGCGGTTTTG CCCGTATCGG CTCCGGCGCC
GATGTCGCGG CGATCGCCAT GCAGTCGGGC GGCGCGGCCG CCGACGTCTG GATCGTCAGC
GACAAGCCGG TCGAGCGCCA CACGCTGCTG CCGGCCGAGG GCAGCTTTAC CCGCAACATG
CCGGGCAGCC TGCCAAGCCG GGCGGCCGAC AATCTGTTCT GGCTCGGCCG CTACATCGAG
CGCGCCGAAG GGGCGCTGCG CATCCTGCGC GCCTGGCATG CGCGTTACGC CGAAGCCGCC
GATCCGAGCC AGCCGCTGCT CGCCGACGTC TCCGCCTATC TCTCAGCCGT CGATATCGAT
ACCGCCGAAC CCGTGCCGGA AACGCTGCTG CGCAACATCG ACAGCGCCGT TTATTCGGCG
AGCAACATCC GCGACCGTTT TTCGCCGGAT GGCTGGCTGG CGCTCAACGA TCTTGCCAAG
ACCGCCCGCC GCTTCCATGT CACCGTCGCC GCCGGCGACG ACGCCAGCCA CGCGATGACG
ATCCTTCTGC GCAAGCTTGC GGGCTTTGCC GGCCTCGTGC ACGAAAACAT GTACCGCTTC
ATGGGCTGGC GCTTCCTCTC GCTCGGCCGC TATATCGAGC GCGGCCTGCA CATGACGCGG
CTGCTCGGCC ACATGTCCGG CCCGGAAGCG CCCGACGGCG CCCTCGACAT GCTGCTCGAA
ATCGGCGACA GCGTCATGAC CCACCGCCGC CGCTACAACG TCAACACGGC GCGGCTGACC
GTCACCGACC TGCTGGCGCT CGATCCCCTC AACCCCCGCT CGGTCCTCTT CCAGGTCAAC
GAGATCCACC ACGAGGTCGA GCAGCTGCCG AACGCCCTGA TCAACGGCCA GATGTCGCCC
TTCTACCGCG AGGCGATGCG GCTCCACTCG GGCCTGGCGG TGATGACGCC GGAGGGCATG
GGGGCCGAGG TCTATCAGCG CCTCGAACGC GAATTGGAGC AGCTTTCCGA TCTGCTCGCC
CAGACCTATC TCGGGTGA
 
Protein sequence
MGKRPATERR EETEDRTGKR AAFDYAPLPG TADEMVDNKG RVRPVWQHFL SHLSAMPEKD 
LAERFARADR YLRDAGVFYR AYGSKGTGER AWPISHIPVL IDEREWQTLS AGLVQRADLL
EAIVADIYGD NRLVEEGVLP PALMAANPEF QRPLAGIRPA SGHYLHFCAF EIGRGPDGNW
WVLADRTQAP SGAGFALESR VATTRAFSDI YAETPVHRLA SFFGGFRDAL QGMKHSGDDR
IAVLTPGPAN ETYYEHAYIA RYLGFMLLEG EDLTVVKGRV MVRTVAGLKP IGVLWRRLDS
AFADPLELNQ NSHIGTPGLV EALRAESVTI VNALGTGVLE TRALLAFMPT ICRRLLGQDL
QLPSIATWWC GQEDEREHVA KHIEKMVIGP AYSRAPFFDD DGESVLGSSL RATAKDSITD
WLSSDGPKLV GQEVVTLSTT PAWVDGKLVP RPMSLRVFAA RTANGWQIMP GGFARIGSGA
DVAAIAMQSG GAAADVWIVS DKPVERHTLL PAEGSFTRNM PGSLPSRAAD NLFWLGRYIE
RAEGALRILR AWHARYAEAA DPSQPLLADV SAYLSAVDID TAEPVPETLL RNIDSAVYSA
SNIRDRFSPD GWLALNDLAK TARRFHVTVA AGDDASHAMT ILLRKLAGFA GLVHENMYRF
MGWRFLSLGR YIERGLHMTR LLGHMSGPEA PDGALDMLLE IGDSVMTHRR RYNVNTARLT
VTDLLALDPL NPRSVLFQVN EIHHEVEQLP NALINGQMSP FYREAMRLHS GLAVMTPEGM
GAEVYQRLER ELEQLSDLLA QTYLG