Gene Rleg_2723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2723 
Symbol 
ID8013671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2709319 
End bp2710650 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content64% 
IMG OID644825295 
Productprotein of unknown function DUF21 
Protein accessionYP_002976525 
Protein GI241205429 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGATT CCGCCGGCGG GCTCTCCGAC TATATCGGGA TACTAGCCGT GTTTCTTCTC 
GTCGCCGCCA ATGGCTTCTT CGTTGCCGCC GAATTCGCCC TGGTGTCAGT CAGGCGTAGC
CGCGTCGCCG AACTCGCTGC GGCAGGCCGC ATGAACGCCT CGGCACTTCA GCGCGCCGTC
GACAATCTCG ACTCCAACCT TGCAGCCACC CAGCTTGGCA TCACCATCTC GTCGCTCGCC
CTCGGCTGGG TTGGCGAACC GGCGCTTGCC CATCTGATCG AGCCGCTGCT GTCCTGGCTG
CCCGGGCAAT GGGCGACGGC GGGCGCGCAT ACTGTTGCCG TCGTCATCGC CTTCGTCATC
ATCACGGCCC TGCATATCGT GCTCGGCGAG CTCGCGCCGA AGAGCCTAGC GCTTCAACGC
AGCGAGGCCA CTTCGCTTGC CGTGGTGCGC CCGCTCGGTC TGTTCCTGGT GCTGTTTAAG
CCGGCGATCT TTGTCCTGAA CGGCATGGGC AACATGGTGC TGCGGGGCGT CGGTCTTCGC
GCCGGAACCG GGGAATCGTC GTTCCATTCG CCGCAGGAGC TCAAGCTGCT GGTCGCTGAG
AGCCAGGAGG CCGGCCTTCT CAACCAGGTG CAGCAGCAGC TCGTCGAGCG GGTGTTCAAC
ATCGGCGACA GACCGATCTC CGACATCATG ACCCCGCGTC TCGACATCGA ATGGTTCGAC
GCCGACGACA GCGAGGCCGA GATCCTGAAG ACCATCCGCG AATGCAGCCA CGAGCAATTA
CTGGTCGCCC GCGGCTCGAT CGACGAGCCG ATCGGCATGG TGTTGAAGAA GGACCTTCTC
GATCAGGTTC TCGACGGCGG CAAGGTCCGG CCGATGGAGG TGATCAAGCA GCCGTTGGTG
CTGCATGAGG GCACCTCGGT CGTCCGCGTG CTCGACAGTT TCAAGGCCTC ACCCGTTCGC
CTCGCCATCG TCATCGACGA ATATGGCAGC CTCGAAGGCA TCGTCACCCA GACCGACCTG
CTCGAAGCCA TCGCCGGCGA CCTGCCGGGA TCCAACGAAG AGCCCGACAT CGTCGTCAGG
GAAGACGGGT CGCTCTTGAT CGATGCGATG ATGCCGGCCT TCGACGCCTT CGAACGGCTG
GGCCTGCGCG ATCGTCCGGA TGCCGATTTC CATACGCTGG CGGGCTTTGC GCTGCATCAG
CTCCAGCACA TCCCCGAAGC CGGCGAAACC TTCGTCTTCG ACAACTGGCG CTTCGAAGTG
CTCGACATGG ACGGCATGCG TATCGACAAG ATGCTCGCGA CGCGCATTCC CGCGGATGGG
GCGGAGGCCT AG
 
Protein sequence
MSDSAGGLSD YIGILAVFLL VAANGFFVAA EFALVSVRRS RVAELAAAGR MNASALQRAV 
DNLDSNLAAT QLGITISSLA LGWVGEPALA HLIEPLLSWL PGQWATAGAH TVAVVIAFVI
ITALHIVLGE LAPKSLALQR SEATSLAVVR PLGLFLVLFK PAIFVLNGMG NMVLRGVGLR
AGTGESSFHS PQELKLLVAE SQEAGLLNQV QQQLVERVFN IGDRPISDIM TPRLDIEWFD
ADDSEAEILK TIRECSHEQL LVARGSIDEP IGMVLKKDLL DQVLDGGKVR PMEVIKQPLV
LHEGTSVVRV LDSFKASPVR LAIVIDEYGS LEGIVTQTDL LEAIAGDLPG SNEEPDIVVR
EDGSLLIDAM MPAFDAFERL GLRDRPDADF HTLAGFALHQ LQHIPEAGET FVFDNWRFEV
LDMDGMRIDK MLATRIPADG AEA