Gene Rleg_0100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0100 
Symbol 
ID8011341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp96024 
End bp97955 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content55% 
IMG OID644822691 
Producthypothetical protein 
Protein accessionYP_002973950 
Protein GI241202854 
COG category[S] Function unknown 
COG ID[COG5616] Predicted integral membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTTG TTCTAAACAC CTTTGGCAGG TTGCAGCTCG TTGACGGGGA GGGGAGCCTC 
GTCGCCTTTC CCGAGAAAGG TTTGCTGCTT CTCGTCTATT TGTTGACGAC CGGTGAAGGT
TCGGCGGATC GAACGACCTT GGCGCGTTTT CTGTGGGGCG ATGCCGATAG GGACGTTGCG
CTTTCGACGT TGCGTAAGCT GATTTCGAGA GTGAAGGCCC GTCAAGCCGA ACTCGGAATA
AACATTCTCT CATCCCAGGG CAACATGGTC TCTCTCGACC GAAAGTCCTT GTCTTCCGAC
CTCCTGCTAT CCGAGACCGA CGAAGCGGTC GCGTCGTTCT CTCTGCTTAA ACATCTTGTG
AAGCTGCTGA ACCAACCCTT TCTGGGGCCG GTTCACTGCC ACAGCCGCGA GTTTCAACAG
TGGCTTGCCG AGCGCGAAAA ATGCCATATC GACCTTCTCG CAAATACATT GAAAACGGTA
TCGCGACGAG CGCAGTCGAG AGCGGAATCA GAACTCCTGC GAAAGGCCGC CATTATCCTG
TTTCGGACGG AACCGAAGGA TCCGGATACG CTGCAGTTGC TGATAGAGAT ATTCAAGGCG
GAGGAAGAGG TTGAATCGCT TCGGACCTAT TTCGAACAGC GGCGTAATTC GATTTCGCGA
GGGATCGCGG TACGCGGCGC ATCCGACGGC GCTGACACAA AACCCGTTCG CCCGGCGTTA
GTGCCGTCAA GGGAAAAACA CGTAACCGCT GCTTCCCTTG AGCCCGAGGA CGTCAGCATT
GCAATTCCTC GCCTGGTGCT GCTTCCACCC AGAAATCAAT CCATTCATCC CCAAGCCGGT
TTTTTAGCTG CGTCTTTGGT GGAGGATATC ACGATTGGAT TTTGCGCCTT CAACAGCCTG
CAGGTCATAG CCCCATATTC GGCGGTGCAA ATTGGCCACC ACATGGAGAC CCAGAAGGCC
TTCTTTGAAC GACATCACGT CAATTACATT CTCGACACTC GGATCAGCAA TGCGGGCGAT
GACGTCACCC TGTTCGCCCA ACTGATCTTT TTTGACCAGA ATCAAATTGT CTGGGCAGAG
AGGTTCAGCC TTGATCATCG GGATCTTGTC AAAGACAGGA GGACTGTCTC TCGACGGATT
GCTCTTTCCA TATCCAGCGA AATCGAGCGC CATGAGGCGT TGCGCGAGGA TTTGAACCCG
GCTGCCTACC ATCGATATCT CGTTGGCAGG CGGCATCTGG CGCGGCTGAC ACTTCCGAAT
CTACGGCGTG CGCGCAAGGA GATGAAAGCC GCGCTCAGCC TCAGCCCCGA TTTCGCACCG
GCGCTGAGTT CAATGGCGCG GACTTACTCC AAGGAATGGT TGTTGACCGC GCGGGGTGAT
ATCGATCTGT TGAAAACGGC AGAGATCTTG GCAAAGCAGG CCACCGAAAC GCGTCCAGAT
TTTGCCGATG GATATCGCGA GTTCGGCGTG GCGAAATTGT TGCAGGGTGC ATTTGACGAA
AGCGCCGAGG CAATGGAAGT GGCGGAGAGC CTTGCCCCGC ACTATGCGGA TGTTATTGCC
GACTACGCCG ACACTTTGGT TCATTGTTCG CTCCCTGCCA TCGCCTTGCG AAAGATCGAG
CGGGCAATCG AGCTGAACCC GCTCAGCCCC GACACCTATT TCTGGACCGC TGCCGGCGCA
AATTATGCCC TTGGCGAATT CGAAGCTTCG CTGGATTACA TTGGGCAGAT GGCCGATGCC
AGTTTGGCCG ACAGGCTAGC GGCCGCAAGC TGGGCCATGT TGGGCCACCA GGACAAGGCG
CGGATCTTCG TCAGGAGGTT TCGCGAAGTC AATCCGGACT TCGACGTGGA CAAATGGCTG
TCTGCGGTTC CGAGTAAGGA GCAATGGCAT AAGGATCTTT ACCGAGAAGG CCTGAAGAAA
GCTGGATTTT AA
 
Protein sequence
MAFVLNTFGR LQLVDGEGSL VAFPEKGLLL LVYLLTTGEG SADRTTLARF LWGDADRDVA 
LSTLRKLISR VKARQAELGI NILSSQGNMV SLDRKSLSSD LLLSETDEAV ASFSLLKHLV
KLLNQPFLGP VHCHSREFQQ WLAEREKCHI DLLANTLKTV SRRAQSRAES ELLRKAAIIL
FRTEPKDPDT LQLLIEIFKA EEEVESLRTY FEQRRNSISR GIAVRGASDG ADTKPVRPAL
VPSREKHVTA ASLEPEDVSI AIPRLVLLPP RNQSIHPQAG FLAASLVEDI TIGFCAFNSL
QVIAPYSAVQ IGHHMETQKA FFERHHVNYI LDTRISNAGD DVTLFAQLIF FDQNQIVWAE
RFSLDHRDLV KDRRTVSRRI ALSISSEIER HEALREDLNP AAYHRYLVGR RHLARLTLPN
LRRARKEMKA ALSLSPDFAP ALSSMARTYS KEWLLTARGD IDLLKTAEIL AKQATETRPD
FADGYREFGV AKLLQGAFDE SAEAMEVAES LAPHYADVIA DYADTLVHCS LPAIALRKIE
RAIELNPLSP DTYFWTAAGA NYALGEFEAS LDYIGQMADA SLADRLAAAS WAMLGHQDKA
RIFVRRFREV NPDFDVDKWL SAVPSKEQWH KDLYREGLKK AGF