Gene Rleg_3141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3141 
Symbol 
ID8014044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3140029 
End bp3141267 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content60% 
IMG OID644825707 
Productprotein of unknown function DUF989 
Protein accessionYP_002976935 
Protein GI241205839 
COG category[S] Function unknown 
COG ID[COG3748] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.182707 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATGAAT ATGCCATAGC ATGGGAATGG CTCGCCTTCG CGGCGCGTTG GTTCCATGTC 
ATTACCGCGA TCGCCTGGAT CGGATCGTCC TTCTATTTCA TCGCGCTCGA TCTCGGACTG
GTGAAACGCC CACATCTGCC ACCCGGCGCC TATGGCGAGG AATGGCAGGT CCATGGCGGC
GGCTTCTATC ATATCCAGAA ATATCTGGTG GCGCCCGCCC AGATGCCGGA GCACCTGACC
TGGTTCAAAT ACGAGAGCTA TTTCACCTGG ATTTCCGGCT TCCTGATGCT GTGCATCGTC
TATTACGGCG GCGCCGACCT CTTCCTGATC GATCGGCATG TGCTGGATAT CAGCCCGCCC
GTCGCAATCC TGATCTCGCT AGGGTCGCTT GCCCTCGGCT GGGTCGTCTA CGATCTGCTC
TGCAAGTCGC CGCTTGGGCG GAATACCTGG GGGCTGATGG CGGTGCTCTA TGTCGTGCTC
GTCTTCATGG CCTGGGGCTA TACGCAGCTT TTCACCGGCC GCGCCGCATT CCTGCATCTC
GGCGCCTTCA CCGCGACAAT CATGTCGGCC AACGTCTTCA TGATCATCAT TCCGAACCAG
AAGATCGTCG TCGCCGACCT CATCGCCGGA CGGGTTCCCG ATCCCAAATA TGGGCAGGTC
GCCAAGCAGC GTTCGCTGCA TAACAACTAC CTGACGCTGC CCGTCATCTT CTTCATGCTG
TCGAACCATT ATCCGCTCTC CTTCGGTACG CAGTTCAACT GGGTGATCGC GGCTCTGGTC
TTTCTGATGG GCGTCACCAT CCGCCACTGG TTCAACACGA CGCATGCCAG GAAAGGCCGG
CCGACCTGGA CCTGGATCGT CACCGTCATT CTCTTCATCC TGATCATCTG GCTTTCGACC
GTGCCGAAGC TGCTGACCGG CGAAACGGAT GCGGCAGCCG TCGCGCCCGC CTTCCAGCAA
TTCGCCGGCG ATCCGCATTT CCCCGCCGTC AAGCAACTGG TCTCGACGCG CTGTTCCATG
TGCCACGCGG CCGAGCCGGT CTATGAGGGT ATCGCGCGGC CGCCCAAGGG CGTGATGCTC
GAAAACGACG CGGAAATCGC CGCCCATGCC CGCGAGATCT ATATACAGGC GGGCCGCAGC
CATGCCATGC CGCCCGGCAA CATCACCGAT ATCACGCCGG ACGAGCGCAA GCTGCTGGTC
GCCTGGTTCG AGAGCGCAGT CGAAGGCAAG CAACAATGA
 
Protein sequence
MYEYAIAWEW LAFAARWFHV ITAIAWIGSS FYFIALDLGL VKRPHLPPGA YGEEWQVHGG 
GFYHIQKYLV APAQMPEHLT WFKYESYFTW ISGFLMLCIV YYGGADLFLI DRHVLDISPP
VAILISLGSL ALGWVVYDLL CKSPLGRNTW GLMAVLYVVL VFMAWGYTQL FTGRAAFLHL
GAFTATIMSA NVFMIIIPNQ KIVVADLIAG RVPDPKYGQV AKQRSLHNNY LTLPVIFFML
SNHYPLSFGT QFNWVIAALV FLMGVTIRHW FNTTHARKGR PTWTWIVTVI LFILIIWLST
VPKLLTGETD AAAVAPAFQQ FAGDPHFPAV KQLVSTRCSM CHAAEPVYEG IARPPKGVML
ENDAEIAAHA REIYIQAGRS HAMPPGNITD ITPDERKLLV AWFESAVEGK QQ