Gene Rleg2_3866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3866 
Symbol 
ID6982629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4008719 
End bp4009792 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content51% 
IMG OID643398588 
Productprotein of unknown function DUF955 
Protein accessionYP_002283354 
Protein GI209551437 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2856] Predicted Zn peptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000210124 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0502715 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCTGCG CAGAAAGACT GAAAGTAGCC CGGCATAGGA AGAAGCTATC GGGAAAGCAA 
CTAGCGGAAG CATCTGGTCT TACGGAGGTA ACCGTTTCGA AAGTAGAAAA CGGTCACCAG
CCTGACGAAG CGACCATAGA AAAGTTGATT AATGCGCTCG GCTACCCCCG CGCATTCTTC
TTCATGGATA GGCCAGAGAT TCTTGAACCA CGCTCTGTAT CATTTCGTAG TCTAAAGAAA
ATGAAGGCAG CGGAGCGGAA TGCCTCGCTG GCAGCAGGCT CTAACGGCAT TGCCCTTTAT
CAATGGGTTG ATGAACGTTT TAAGCTGCCG GCGCCAGACC TCATCGATCT AAGCAGAGAG
CAGGAGCGAC CGGAAGTGGC CGCACGTCTG CTACGCCAGC ATTGGGGCAT AGGGGATCGT
CCGATCGGCA ATATCCTGCG ATTATTCGAA TCGAAGGGTA TCAGAGTGCT TTCGCTCTCA
GAGAACACGC AAAACGTGGA TGCCTATTCC TTCTGGAATG CAGATCATCC TTATATTTTC
CTCAACCAGA GAAAGACTGC TGAGCGTTCC AACTTCGATG CCGCGCACGA GCTTGGACAT
TTAGTTTTAC ACTTCCATGC CCAGGCTGAA TCGGCCCCAG AAGACGATGC AGAACGGCAA
GCAAATCAAT TTGCTTCAGC CTTCTTGATG CCCGAAGCCG ATCTGAAAAA CTCGATTGGT
CAGATATATA GTTCATCGCA AATTATCAAA GCGAAGGTCC GATGGAAGGT TTCAGCCATG
GCATTGGCAA TGAGGCTGAA CCAAGCCGGG ATGCTGTCAG ATTGGAACCA TCGGTCAATC
GTCATTGACC TTGGTCAGAG GGGTTACCGA ACGGGCGAAC CTCTGGGCGT CGAACGGGAG
GCTTCCACAC TGCTAGCGAA AGTATTTGCT GCGTTGTGGT CTAGAGGGAT CACGAAAAGC
GACATAGCCA ACGATCTCAA TCTTCCCTGG GACGAGGTCG AATCATTAGT GTTTGGCTTG
ACAGGCCCAG CCCCGGCACG ACCAGCAAAA GGTAACATCA CACTTATCAA TTAG
 
Protein sequence
MFCAERLKVA RHRKKLSGKQ LAEASGLTEV TVSKVENGHQ PDEATIEKLI NALGYPRAFF 
FMDRPEILEP RSVSFRSLKK MKAAERNASL AAGSNGIALY QWVDERFKLP APDLIDLSRE
QERPEVAARL LRQHWGIGDR PIGNILRLFE SKGIRVLSLS ENTQNVDAYS FWNADHPYIF
LNQRKTAERS NFDAAHELGH LVLHFHAQAE SAPEDDAERQ ANQFASAFLM PEADLKNSIG
QIYSSSQIIK AKVRWKVSAM ALAMRLNQAG MLSDWNHRSI VIDLGQRGYR TGEPLGVERE
ASTLLAKVFA ALWSRGITKS DIANDLNLPW DEVESLVFGL TGPAPARPAK GNITLIN