Gene Rleg_3340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3340 
Symbol 
ID8014222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3343367 
End bp3344533 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content64% 
IMG OID644825899 
Producthypothetical protein 
Protein accessionYP_002977126 
Protein GI241206030 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0818449 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCGAAT ATGCTCTCTT GTTCGGATTG GGCTTCCTGA CCGCGGCCTT TCTCGTCTTT 
CTGGTCTCGC CCGCCGTTCA CCGCCGCATC GTCTGGTATA CGGAGAACAG GCTGAAAGCG
ACGATGCCGC TGAGCCCGCA GGAGGTTCGC GCCCAGAAGG ATATGGTGCG CGCGCTCTAT
GCCGCCGAAA ACGCCCGCAC CGCCCAGGAC CTTCTGCGTG AGCGCGAGAA ATCCCTGTCA
CTCCAGCTTC GCCACGACGC CCTCGCCGTC GACGCCGGCA GGTTTGCCGC CGAGATCGGC
GAATTGCAGG CCCAGATCGG CGAGATGCAT GTCGAGGCGG CCGACCAGCG CTCGCATCTT
CGCAAGGACG AGAACTATAT CAGCCAGTTG AAGACCAATC TGCATATTGC CGAGCAGTCC
GCTGCCAATA AGGAGAGCGA ACTGACGACG ATGAGGACGC GGCTGGGCAA ACTCGGTGAA
CAGGCCGATG GGCTGAGGAT AGACCTGGCC GCGCGCGAGA CCGAGGCGGA GAGCCTGAAA
TTCCGCGTCA ACGCGCTACG CGACGAACGC GACACGCTGC GCCAGGACGT CAGCCTGCTG
CAGAAGCGCG CCAAGGATGC CGAACAGAAA CTGACGCAGC AGCAGCATAT GGTGATCCGC
CTGGAGGACA AGGCCGCGCG CGAGAGCGCT TCCGCCACCG AAAAGGAAAA CCTGGTCGCT
CGTCGCCAGC AGGAGATCGC CAAGTTGAAG GAGCAGTTGA AGGCCGCCAA CACCGAGATC
CGCAAGGTCA ACCGGGTGCT GCGCGACGCC GGCCTTGCCG GGTTGGTCGC GGAGCTGCCG
GCGGAAATGA CGGCCGAAGA CACGACCACA TCCACGTTCG ACACCGCCAT GATCACGGCT
GAGATCGGCG AGGATGTCCG CAAGCGCAGC GCCGCCCTTG CCGAGCGCCT GCAAAAGGCA
AAGGCCGTAA CCGGACGCGA CGGTGCGATC CGCGAGGAGA TCGCCTCGAT CGCCGCCAAT
ATGGTGGCGC TGACCGCACT CAGCGAGGGT CCCGCCTCAC CGATCCGCAC GCTGCTCGTC
GAAGCTGCGG AAAAGAACGC GAACGATCGT GTCAGCCTTG CCGACAGGGC AGCCGCGATC
ATCGCCGATC CCCCGCGCGC GCCATAG
 
Protein sequence
MIEYALLFGL GFLTAAFLVF LVSPAVHRRI VWYTENRLKA TMPLSPQEVR AQKDMVRALY 
AAENARTAQD LLREREKSLS LQLRHDALAV DAGRFAAEIG ELQAQIGEMH VEAADQRSHL
RKDENYISQL KTNLHIAEQS AANKESELTT MRTRLGKLGE QADGLRIDLA ARETEAESLK
FRVNALRDER DTLRQDVSLL QKRAKDAEQK LTQQQHMVIR LEDKAARESA SATEKENLVA
RRQQEIAKLK EQLKAANTEI RKVNRVLRDA GLAGLVAELP AEMTAEDTTT STFDTAMITA
EIGEDVRKRS AALAERLQKA KAVTGRDGAI REEIASIAAN MVALTALSEG PASPIRTLLV
EAAEKNANDR VSLADRAAAI IADPPRAP