Gene Rleg2_2023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_2023 
Symbol 
ID6980762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp2084441 
End bp2085631 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content64% 
IMG OID643396745 
Productlow temperature requirement A 
Protein accessionYP_002281533 
Protein GI209549616 
COG category[S] Function unknown 
COG ID[COG4292] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.226611 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAAA CGAACGGAAA GAACTGGCTG CGCGCCAAGG GGAGCGCGGC CGGCAGCAAG 
GTGACCTTTC TCGAACTGTT CTTCGACCTG GTCTTCGTCT TTTCGATCTC GCAGCTTTCG
CATGCGCTTG CCGCCCATTA TACACCGCTC GGTGCTGGCG AAGCGGCGCT GATGACCTTT
GCCGTCTGGT GGGTTTGGAT TTTCACCGCC TGGGTGACCA ACTGGCTCGA TCCCGACAAG
ATGCTGGTGC GCGGCATGCT GGTGGCGCTG ATGATGCTCG GGCTGCTGCT GTCAGCCTCC
ATTCCCGAGG CCTTCGGCGA CAAGGGGCTG CTGTTTGCCG GCGCTTATGT GGCGATGCAG
GTCGGGCGCT CGCTGTTTGC CACCTATGCG ATGACACGCG TCGACCGTGC CAATACGCTG
AATTTTATTC GCATCACCAC CTGGCTCGTG GTGGCCGCCG TCTTCTGGAT CGCCGGCGGG
CTGCTCGAGC ATGAGGCCCG GCTGATCGCC TGGGTGATCG CGCTGGCGAT CGAATATGCC
GGCCCGGCCG CGGGCTTTGC CGTGCCCGGG CTCGGCCGCT CGAAGCCGAG CGACTGGGAC
GTTTCCGGCG CCCACATGGC CGAGCGCTGC GCGCTCTTCG TCATCATCTG CCTCGGCGAG
GCGATCCTGG TGTCCGGCCG CACCTTTTCG GAGCTGCCGG TTTCGGGGTT GACCGGCATC
GTCTTCGTCA CCGGCTTTAT CGGCACCGTC GCCATGTGGT GGCTTTATTT CCGTTTTGGC
CACGGGCGCG CCGCCCACCG CATCGAGCAT GAGGCGACCC CGGGGGCCTT GGCGCGGCAG
GCTTTCACCT ATGGGCACAT CCCGATCCTT GCCGGCATCA TCGTCCATGC GGTGGCCGTC
GAGTTCATGT TCTCGCATCC GCATGAGATG GGCGATCTCG GCATTGCTAC CGCGGTGCTC
GGCGGCTCCG GCCTTTTCCT GATCGGTAAT CTCTGGTTCA AGGGTGCGAC CAGCGGGCAG
CTGCCGCTGT CGCACCTTGC CGGCCTCGTG TTGCTGATCC TGCTTGCCTT CGCGGCACCC
TTCATCGAGA TCTATCTGCT GGGCATTCTG GCGACGCTGG TGCTGATCGT CGTTGCCGCC
TGGGAATACC GCTCGCTGAG CGGTACATCG CCGGCGCCGA CGCTGCATTG A
 
Protein sequence
MAKTNGKNWL RAKGSAAGSK VTFLELFFDL VFVFSISQLS HALAAHYTPL GAGEAALMTF 
AVWWVWIFTA WVTNWLDPDK MLVRGMLVAL MMLGLLLSAS IPEAFGDKGL LFAGAYVAMQ
VGRSLFATYA MTRVDRANTL NFIRITTWLV VAAVFWIAGG LLEHEARLIA WVIALAIEYA
GPAAGFAVPG LGRSKPSDWD VSGAHMAERC ALFVIICLGE AILVSGRTFS ELPVSGLTGI
VFVTGFIGTV AMWWLYFRFG HGRAAHRIEH EATPGALARQ AFTYGHIPIL AGIIVHAVAV
EFMFSHPHEM GDLGIATAVL GGSGLFLIGN LWFKGATSGQ LPLSHLAGLV LLILLAFAAP
FIEIYLLGIL ATLVLIVVAA WEYRSLSGTS PAPTLH