Gene Rleg2_4043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4043 
Symbol 
ID6982814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4217813 
End bp4218970 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content60% 
IMG OID643398773 
Productprotein of unknown function UPF0118 
Protein accessionYP_002283531 
Protein GI209551614 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGAGA ACACAGGCGA TACGGCAGCG CAGGGAGCCG GGTCAGGGCA GATTTCAGTA 
GAGGCGAGAA TAAGCGATCT TGTCCGGTTG GGCATCATCG GGCTTTTCGC CTATTGGACG
ATCGTCTTGA TTGCTCCCTT CGCGCTGATC GTTATTTGGT CGGCCATTCT GGCGGTGGCG
CTATTCCCGA TATTCCAAGC GCTCTGCAGG CTGCTTGGAA ACAGGCCCGT CATCGCAGCC
AGTATCATCG TCGTCTTTTG CCTCGTCCTG ATCATTGCGC CCCTGGCTCT GGTCGCGGTC
AATTTTGCCG ACACCGCGCA GGCATTGATC GGCAAGTTGC GGGTGGGGGA ATTCACGCTC
CCCTCGGCGC CGGCCGCCAT TCGGGAGTGG CCCGTTGTCG GCGAGCGGCT CCATGACGCG
TGGAATCAGA TTGCAAGCGA TCTGGCCGCA ACGATTATCA AGTTTCAGGC GCCCATTCGT
GAAGTGACGG CCGTTATCGT CACAAAGCTT GCCTCGATCG GCGGCGGCGT GTTGAGCTTT
GTCGTTTCGA TCATGCTTTC GGGAATATTT CTCACGCGGT CAGCACGCCT GGCGGCGGCC
ATACAAGTGC TGGCAAACCG GATCGCCGGT GAAAAGGGTG TCGGCTTTGC CCGGCTGGCG
GGAGCCACGG TGCGCAATGT ATCGCGGGGT GTCATCGGCG TTGCCTTCCT GCAGACATTG
CTCTGCGGAT TGTGCTTTGC TTTCTTTGGC GTCCCGGCGC GTGGGGCGCT GACATTCGTG
ATCTTCATGT TTTGCCTGAT GCAGCTGGGG CCTGGGCTCG TGCTTCTTCC CGTTATCATC
TGGTCGTGGT TTTCGTGGTC GCCCGCCGCT GCTTTTGCCT TTACCGCCAT TACCGTGCCC
ATCATGCTCA TCGACAACAT ATTGAAGCCC GTGCTGATGG CGCGGGGGCT CTCGACCCCG
ATGCCGGTCA TCCTGATCGG AGTCATCGGC GGCACACTTT CCCACGGGCT GCTGGGCTTA
TTTCTGGGGC CGGTCGTGCT CAGCGTCTTC TACGAGCTGC TGAAAGCCTG GGCCTGGCCC
TCAGTCCAGA CCGCGTCGGA AAACAGCGGC CCAGCCAAGC TCGATGCTCT GCCGGAACGC
ATCGAGCACA GGCAATGA
 
Protein sequence
MAENTGDTAA QGAGSGQISV EARISDLVRL GIIGLFAYWT IVLIAPFALI VIWSAILAVA 
LFPIFQALCR LLGNRPVIAA SIIVVFCLVL IIAPLALVAV NFADTAQALI GKLRVGEFTL
PSAPAAIREW PVVGERLHDA WNQIASDLAA TIIKFQAPIR EVTAVIVTKL ASIGGGVLSF
VVSIMLSGIF LTRSARLAAA IQVLANRIAG EKGVGFARLA GATVRNVSRG VIGVAFLQTL
LCGLCFAFFG VPARGALTFV IFMFCLMQLG PGLVLLPVII WSWFSWSPAA AFAFTAITVP
IMLIDNILKP VLMARGLSTP MPVILIGVIG GTLSHGLLGL FLGPVVLSVF YELLKAWAWP
SVQTASENSG PAKLDALPER IEHRQ