Gene Rleg2_4033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4033 
Symbol 
ID6982804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4207463 
End bp4209253 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content61% 
IMG OID643398763 
Producthypothetical protein 
Protein accessionYP_002283521 
Protein GI209551604 
COG category[S] Function unknown 
COG ID[COG5616] Predicted integral membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGTG CAGGCATAAA TAGGATCGCC GCCTCGCAGG AGGAGGTAGA GCAGCAGCTT 
GAGCGCATTC TTTCCAGCCG CGAGTTCCGC CTGCCCGAAC GAACGAGGAA GTTCCTCGAA
TTCGTGGTCA CGGAAACGCT GGCGGGCCGT CGCGACTATC TGAAGGCCTT TACCATCGCA
CAGGCCGTTT TTGGCCGGGA CGCGAACTTT GATGCCCAGC AGGATCCCTG CGTTCGTATC
GAAGCCGGCC GGCTGAGACG GGAACTCGAA CACTATTACC TCACAGCCGG CGGCACCGAC
CGGATCATCA TCACCATCCC GAAGGGCGGC TACGCGCCGG TCTTCGATGT CATCGGCGGC
GCTGAACCCG CTGATATCCT GCCGCTCGGG CAGCCCGAAC GGCCGGGCGT GTCAGGCGCC
GATCACGGAC AGATCCATGC GGGAACGGAT GCGACCAGCC GGAACCCCGG GGGCTGGCGC
CTTTCGCCTC GATATTGGCT GCTTGCGGCA GGGGCGGTGA TTATCCTCGC CTCAGCAGCA
GCTCTGCTTC GGCAGGTCGA ATCGCCGAGT GCGGAGCGCG AGGCGGGACC GAGCGCCAAT
AACCGCCCCA CCATTATCGT CGAGCGTTTT GAAAGCGGCT CCGGTGGGAA CCTTGCCTCC
GATATTTCTC GCGGCATGAC CGACGACATC ATCGAGAAGC TGGTGCGCTT CAATGACATC
GTTGTCGTCA CCGCCATGCC GCGGAATAAA TCCGGCCAAG TTTCGGCAGA GTCGCTCTAT
GCGCTGCAAG GAAGTGTGCG GCTCGAAGGC AGCATGCTGC GCTCGACGGC AAGGCTCGTG
CGGCGGGCGG ATGCGGCTGT CATCTGGGCA AGCAATTACG ACGCCGATAT GACGGTGCAA
GGCATCCCGA AAACGCAAGC GAGCCTTGCC GGGGATATCG CGACTGCGGT CGCGCGCCCG
TTCGGCGTCA TGTTCCAAAC CGATACCGCG ACCATCGCCG GACGCACGGA CGCCTTTTCA
TGCATTCTCT CCTACTATAG CTACCGCAGC GAAATGACCG TGCAGGCTCA TGAGGTGGCG
AAATCCTGCC TGCAACGGGC CGTGGAGAAG ATGCCGGCCG ATTCCAATGT CGTGGCCCTA
CTTTCGTTGA TCCATCTCGA CGAGTTCCGC TTTTCATACC AACTTCACAC GAAATCGACG
GCCGCGACGC TTGGCCTGGC AAAGCAACTT GCCGAGCATG CGGTGCGGCT CGACCCGAAG
AATGCACGCG CTCTTCAGGC GCTGATGCTT GCCAATTTTT TCGACAATGA TCCGGCTGCG
GCCCTCAGCG CCGGCGCCGG CGCCTATGCC AGCAATCCAA ACGATACCGA AGTGGCCGGT
GAATACGGCC TGCGGCTGTC GATGTCGGGG GAATGGGACA GAGGCTGCAC GCTGATTTCA
GAAGCGGTCG GCAAGAATGC GGGGCCACGC GGATATTACG AGGTCGGAAT GGCGCTCTGC
GCCTTCATGC GGGGCGATAC ACAGGCAGCG GAACTCTGGT CGCGCATGTC GGATCTCAAC
TACAATCCGA TGCATCGCCT CGTATTGCTC TCCATTCTCG GCGCGCTTGG AAAAAAGCAG
GAGGCAAAAG AACAGCTTGA ATGGATCCGG CGCGAGTCAC CCGCGTTGAT CCCGCACATC
AGGCAGGAAG TCACAAGGCG GCTGGCGCGG ACCGAGGATC AGCGGCGGTT TCTTGCGGGA
ATAGAGGCTG CCGGTTTGTC GGTGCAAGAT GGTGAGGCGC CGAAGGATTG A
 
Protein sequence
MTSAGINRIA ASQEEVEQQL ERILSSREFR LPERTRKFLE FVVTETLAGR RDYLKAFTIA 
QAVFGRDANF DAQQDPCVRI EAGRLRRELE HYYLTAGGTD RIIITIPKGG YAPVFDVIGG
AEPADILPLG QPERPGVSGA DHGQIHAGTD ATSRNPGGWR LSPRYWLLAA GAVIILASAA
ALLRQVESPS AEREAGPSAN NRPTIIVERF ESGSGGNLAS DISRGMTDDI IEKLVRFNDI
VVVTAMPRNK SGQVSAESLY ALQGSVRLEG SMLRSTARLV RRADAAVIWA SNYDADMTVQ
GIPKTQASLA GDIATAVARP FGVMFQTDTA TIAGRTDAFS CILSYYSYRS EMTVQAHEVA
KSCLQRAVEK MPADSNVVAL LSLIHLDEFR FSYQLHTKST AATLGLAKQL AEHAVRLDPK
NARALQALML ANFFDNDPAA ALSAGAGAYA SNPNDTEVAG EYGLRLSMSG EWDRGCTLIS
EAVGKNAGPR GYYEVGMALC AFMRGDTQAA ELWSRMSDLN YNPMHRLVLL SILGALGKKQ
EAKEQLEWIR RESPALIPHI RQEVTRRLAR TEDQRRFLAG IEAAGLSVQD GEAPKD