Gene Rleg2_1418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1418 
Symbol 
ID6980146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1442285 
End bp1444216 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content57% 
IMG OID643396139 
Producthypothetical protein 
Protein accessionYP_002280938 
Protein GI209549021 
COG category[S] Function unknown 
COG ID[COG5616] Predicted integral membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.070904 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTTG TTCTGAAGAC CTTCGGGAGG CTGCAGCTCG TTGACGAGGA GGGGAACACT 
GTCGCGTTTC CCGAGAAAGG CCTGCTGCTG CTCGCCTATT TGTTTACAAC CGATCAAGGT
TCGGCGGATC GAACGACCTT GGCTCGTTTC CTTTGGGGCG AGGCCGACCG GGACGTTGCA
CTGTCGACTT TACGGAAGGT GATTTCGAGA ATTAAGGCCC GTCAAATCGA TCTCGGGATA
AACATCCTTT CATCTCAGGG CAACGTGCTC ACTCTCGACC GCGGGGCACT GTCGTCCGAC
ATGCTGCTGG CCGGCGAGGG GGAAGAGGTT GAGCCGTTCT CCCGGCTGAG ACAGCTGGTG
ACGCTGATGA ACCAGCCTTT TCTCGAGCCG GTTCACAGCC ATAGCCGCCA GTTTCAGCAT
TGGCTCGCCG AACGTGAAAA TTACCATAAC GAGCTTCTCA CCAATACTTT GAGAACGGCC
TCGCAGCTGG CCCCGTCGCG AAAGGACGCC GAGCTCCTGA GGAAGGCCGC GGTTATCCTG
TTTCGGACCG AGCCGAAGGA TCCGGAGACG CTGCAACTGC TGATAAAGAT CTTCAAGGCG
GAAGGGGAAG TCGAATTACT TAGGAGCTAT TTCGCACAGC GGCGCAGTGC GACATCGAGA
AGCAGCGCCG CGCGCACCGG ATCCGACGAT AGTGACGCGA AATTCCTGCG ACCGCCATCC
GCGTCGTCAA CGGCGAAACT TGAGGCCTCT CTTCCCCTTG AGCCCGAAGA TATCCGCATT
GCCATTCCCC GCCTGGTGCT ACTTCCCCCC AGAAATCAAT CCGCTCGGCC ACAAGCCGGT
TTCTTAGCCG CGTCTTTGGT GGAGGATATC ACAATCGGAT TTTGCGTCTT CAACAGCCTG
GAAGTCATAG CCCCGTATTC GGCGGTTCAA ATCGGCCACC AGGTGGAGAC GCAGAAGGCT
TTCTTCGAAC GCCATTGCGT CAACTATATT CTCGACACCC GGATCAGCAA TATGGGCGAT
GAGGTGACGC TCTTTGCCCA ACTGATCTTT TTCGATCAGA ATCAGATTGT CTGGGCCGAA
AGGTTCAGCC TCGATCGTAT GGATCTCGCC AGAGACAGGA GAGTTGTTGC CCGGCAGATC
GCTCTCTCCG TGTCCAGCGA AATCGAGCGT CATGAGGCGT TGCGCGAAGA TCTGAACCCG
GTTGCCTACC ACAGATATCT TGTCGGCAGG CGGCACCTGG CGCGGTTGAC GCTTCCGAAC
CTGCGGCGCG TCCGCAAGGA AATGAAGGCC GCGCTCAGCG TCAGCCCCGA TTTCGCACCG
GCGCTGAGTT CGATGGCGCG GACCTACTCC AAGGAGTGGT TGCTGACGGC CCGGGGAGAC
ATCGATCTTT TGCAATTGGC GGAGACTTTC GCCAAGCAAG TCACTGGGAT GCGTTCGGAC
TTCGCCGATG GCTATCGCGA GCTCGGTGTC GCCAAATTGC TGCAGGGTGC CTTCGATGAA
AGTGCCGAAG CCATGGAAGT GGCGGAAAGC CTCGCGCCAC ACTACGCCGA CGTGATTGCC
GACTACGCGG ACACGCTGGT TCATTGTTCG CTCCCCGCCA TGGCCTTGCG AAAGATCGAA
CGGGCAATCG AACTGAACCC GCTCGGTCCC GACACCTATT TCTGGACGGC TGCGGGAGCA
AACTATGCCC TCGGCGAATT CGAAGCTTCG CTGGACTATA TCGGGCAGAT GGCCGATCCT
CATCTAGCGG ATAGGTTGTC TGCTGCAAAT TGGGCCATGT TGGGTCATCA GGATAAGGCG
CGGATCTTCG TAAGAAGGTT TCGTGAAGGC AATCCGGACT TCGATGTGGA CAAGTGGTTG
TCCGCAGTAC CAAGTAAAGA GCAATGGCAT AAAGATCTTT ATCGTGAAGG CCTGAAGAAA
GCTGGATTTT AA
 
Protein sequence
MAFVLKTFGR LQLVDEEGNT VAFPEKGLLL LAYLFTTDQG SADRTTLARF LWGEADRDVA 
LSTLRKVISR IKARQIDLGI NILSSQGNVL TLDRGALSSD MLLAGEGEEV EPFSRLRQLV
TLMNQPFLEP VHSHSRQFQH WLAERENYHN ELLTNTLRTA SQLAPSRKDA ELLRKAAVIL
FRTEPKDPET LQLLIKIFKA EGEVELLRSY FAQRRSATSR SSAARTGSDD SDAKFLRPPS
ASSTAKLEAS LPLEPEDIRI AIPRLVLLPP RNQSARPQAG FLAASLVEDI TIGFCVFNSL
EVIAPYSAVQ IGHQVETQKA FFERHCVNYI LDTRISNMGD EVTLFAQLIF FDQNQIVWAE
RFSLDRMDLA RDRRVVARQI ALSVSSEIER HEALREDLNP VAYHRYLVGR RHLARLTLPN
LRRVRKEMKA ALSVSPDFAP ALSSMARTYS KEWLLTARGD IDLLQLAETF AKQVTGMRSD
FADGYRELGV AKLLQGAFDE SAEAMEVAES LAPHYADVIA DYADTLVHCS LPAMALRKIE
RAIELNPLGP DTYFWTAAGA NYALGEFEAS LDYIGQMADP HLADRLSAAN WAMLGHQDKA
RIFVRRFREG NPDFDVDKWL SAVPSKEQWH KDLYREGLKK AGF