Gene Rleg_3022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3022 
Symbol 
ID8013937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3016332 
End bp3017432 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content64% 
IMG OID644825590 
Productprotein of unknown function DUF185 
Protein accessionYP_002976818 
Protein GI241205722 
COG category[S] Function unknown 
COG ID[COG1565] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.838138 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCG CTCTAGGCGA AAAGATCAAG GCGATTATCC AGGCCAACGG CCCGATCAGC 
GTCACCGATT ATTTCTCGCT CTGCCTCGCG GACCCCGAAC ACGGCTATTA CCGCACCCGC
GAGCCCTTCG GCCGTTCCGG TGATTTCGTC ACCGCGCCCG AGGTCAGCCA GATCTTCGGC
GAGATGATCG GCGTCTTCAT CGTCCATGCG TGGCAGCGCC ATGGCACACC GACAGACGTC
CGCCTCGTCG AGATCGGCCC CGGCCGCGGC ACCATGATAT CAGACATGCT GCGCGTCATC
TCCCGCATCG CTCCACCGCT TTTCGACGTC ATGACCGTGC ATCTTGTCGA AACCAGCGAG
CGGCTGCGCG ATGTCCAGAG CCAGACGCTC GAACCCCACG GCGAGAAGAT CACCTGGCAT
AATGGCTTCG ACGAAGTACC TCCCGGCTTC ACGCTGATTG CCGCCAACGA ACTCTTCGAC
GCCATCCCGA TCCGCCAGTT CGTTCGCATG GCGACGGGTT TTCGCGAGCG CATGGTCGGC
ATCGACGCCG ACGGCGAGCT GACCTTCGCC CCCGGCGTCG CCGGCATCGA TCCCACGCTT
CTTCCCGAAC CGGTGCAGAA CGTGCCGGTC GGCACACTCT TCGAGATCTC GCCTGCCCGC
CAGGCGGTGA TGATGGCGAT CTGCGAGCGG TTGCGCGCCT TCGGCGGCAC GGCGCTTGCG
ATCGACTACG GCCATCTCGT CACCGGCTTC GGCGATACGC TGCAGGCCGT GCGCATGCAT
GAATTCGACC CGCCGCTCGC GCATCCAGGC GAAGCCGATC TGACGAGCCA TGTCGACTTC
CAGCAACTCG CCGAAACAGC GCTTGCGGCT GGCCTCTATC TGAACGGCGC CCTGCACCAA
GGTGATTTTC TGACCGGCCT CGGCATCCTC GAGCGCGCAA CCGCTCTCGG CCGTGATCGC
GAGCCGCACA CCCAGCAGGT CATCCAGGCG GCGGTCGAAA GGCTTGCCGG CGCCGGTGAA
GGCCGGATGG GCGAACTCTT CAAGGTGATG GCGGTCTCTT ATCCCGCCAT CGATCTCATG
CCCTTTCGTC CGGTGGATTG A
 
Protein sequence
MTTALGEKIK AIIQANGPIS VTDYFSLCLA DPEHGYYRTR EPFGRSGDFV TAPEVSQIFG 
EMIGVFIVHA WQRHGTPTDV RLVEIGPGRG TMISDMLRVI SRIAPPLFDV MTVHLVETSE
RLRDVQSQTL EPHGEKITWH NGFDEVPPGF TLIAANELFD AIPIRQFVRM ATGFRERMVG
IDADGELTFA PGVAGIDPTL LPEPVQNVPV GTLFEISPAR QAVMMAICER LRAFGGTALA
IDYGHLVTGF GDTLQAVRMH EFDPPLAHPG EADLTSHVDF QQLAETALAA GLYLNGALHQ
GDFLTGLGIL ERATALGRDR EPHTQQVIQA AVERLAGAGE GRMGELFKVM AVSYPAIDLM
PFRPVD