Gene Rleg2_2388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_2388 
Symbol 
ID6981127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp2449256 
End bp2450584 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content63% 
IMG OID643397101 
Productprotein of unknown function DUF21 
Protein accessionYP_002281889 
Protein GI209549972 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.945135 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAAT CCGCGGGCGG CGTCTTCGAC TATGTCGGGA TACTCGCCGT GCTTCTACTG 
GTGGCCGCCA ACGGCTTCTT CGTCGCTGCG GAATTCGCGC TGGTGTCCGT CAGGCGCAGC
CGTGTCACCG AACTTGCCGC TGCAGGCCGC ATGAATGCCT CCGCCCTACA GCGCGCCGTC
GACAATCTCG ATGCCAATCT CGCCGCCACC CAGCTCGGTA TCACCATCTC GTCGCTGGCA
CTGGGCTGGG TCGGCGAGCC GGCGCTTGCC CACTTGATCG AGCCTCTGCT GTCCTGGCTG
CCCGGGCAAT GGGCGGCAAC AGGCGCGCAT ACCGTCGCCA TCGTCATTGC CTTCGTCATC
ATTACGGCAC TGCACATCGT GCTGGGCGAG CTGGCGCCGA AAAGCCTGGC GCTTCAGCGC
AGTGAGGCCA CTTCGCTTGC CGTAGTGCGT CCGCTGGGGC TCTTCCTGGT GCTGTTCAAG
CCGGCGATCT TCGTTCTGAA CGGCATGGGC AACCTTGTGC TGCGGGGCGT CGGCCTTCGC
GCCGGAACCG GGGAATCGTC GTTCCATTCG CCGCAGGAAC TCAAGCTGCT GGTCGCCGAA
AGTCAGGAAG CCGGTCTTCT CAATCAGGTG CAGCAGCAGC TCGTCGAGCG GGTGTTCAAC
ATCGGCGACA GACCGATCTC CGACATCATG ACCCCGCGTC TCGATATCGA ATGGTTCGAT
GCCGACGACA GCGAGGCCGA GATTCTGAAG ACCATCCGTG AATGCAGCCA CGAACAATTG
CTGGTCGCCC GCGGCTCGAT CGACGAACCG ATCGGCATGG TGTTGAAGAA GGACCTTCTC
GACCAGGTTC TCGACGGCGG CAAGGTCCGG CCGATGGAGG TGATCAAGCA GCCGCTGGTG
CTGCACGAGG GCACCTCGGT CGTGCGTGTG CTCGACAGTT TCAAGGCCTC GCCTGTCCGG
CTCGCCATCG TCATCGATGA ATATGGCAGC CTTGAGGGTA TCGTCACCCA GACCGACCTG
CTCGAAGCGA TCGCCGGCGA CCTGCCGGGA TCCAATGAGG AGCCCGATAT CATCGTGCGG
GAAGACGGAT CGCTCTTGAT CGATGCGATG ATGCCCGCCT TCGACGCCTT CGAGCGGCTC
GGTCTGCGCG ATCGTCCGGA TGCCGATTTC CATACGCTTG CAGGCTTCGC GCTGCACCAG
CTCCAGCACA TCCCGGAAGC CGGCGAAACC TTCGTTTTCG ATAGCTGGCG CTTCGAAGTT
CTCGATATGG ACGGCATGCG CATCGACAAA ATGCTGGCAA CGCGCATCCC CGCGGACGGG
GAAGGCTGA
 
Protein sequence
MSESAGGVFD YVGILAVLLL VAANGFFVAA EFALVSVRRS RVTELAAAGR MNASALQRAV 
DNLDANLAAT QLGITISSLA LGWVGEPALA HLIEPLLSWL PGQWAATGAH TVAIVIAFVI
ITALHIVLGE LAPKSLALQR SEATSLAVVR PLGLFLVLFK PAIFVLNGMG NLVLRGVGLR
AGTGESSFHS PQELKLLVAE SQEAGLLNQV QQQLVERVFN IGDRPISDIM TPRLDIEWFD
ADDSEAEILK TIRECSHEQL LVARGSIDEP IGMVLKKDLL DQVLDGGKVR PMEVIKQPLV
LHEGTSVVRV LDSFKASPVR LAIVIDEYGS LEGIVTQTDL LEAIAGDLPG SNEEPDIIVR
EDGSLLIDAM MPAFDAFERL GLRDRPDADF HTLAGFALHQ LQHIPEAGET FVFDSWRFEV
LDMDGMRIDK MLATRIPADG EG