Gene Rleg2_6156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_6156 
Symbol 
ID6983229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011370 
Strand
Start bp91901 
End bp95005 
Gene Length3105 bp 
Protein Length1034 aa 
Translation table11 
GC content50% 
IMG OID643399172 
Producthypothetical protein 
Protein accessionYP_002283928 
Protein GI209552012 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATACC GAAAAATAAC AAAAGCGCAA AAGTCGCGAG TTCGCTCTTC AGCTCTTACA 
CCACAAAAGC GTCGAGTACG CGTCTCTAAG GGGGCGCACG CCACTTTACT ATCTATTCGG
AATGAGGCTC CGACGGTGGA GGTACTCAGC CGGACGATCG AGTGGACCTT GGCGCGGTAT
AAGTACCATC GTTATCTGGT CTCGTTAGTA CCACGAACCC CTTCGCAGCT TCCGGATTAC
GTTTTAGCCA CAGGAGATCA TCTTGAGGAT GCGCTAAAGT GGCAATTAGC TGCAATCTCG
GCGGCGCGCG CAAAAGTCGA GAGGCATAAT CGGAACGCTT ACACCAGCAT TGACCAGGTA
ACCGACTACA GCGCCTCCCT TGAAAGTGCC AACGCGGAAA TGCGAACCTG TATGTGGAGC
TATTCGGCAG CTTCACAAGC TCTTTTTTCG TTAGCAAAGA CCAAAGGCCT AGATGCCCAG
CGCCGTTGGA TGCAGAGAAA CGTTTATGTT TCCAATCCCT CAATCTCAAA CATTATTCTC
TACAGCAAGG GGATTGCGTC AGAAGGGGAT CGGCATCCTG TAGACATCAT GGATGTTTTG
AACAGGTTCA TTTTCTCGAA AATGGTTGAC GACGAGTTGA ACCTCCTGCT GTATTACCTG
ATATTCAATC CGCCTTTGAA TTTGGACGCG GCAGCAAAGC TTTCCCCCTT ACTTTTATAT
TTTCCCTTAG TTGATCAATA CGAATTCCTG GCCAATCTCG TTACGTCCGA TCCAAATTCT
TCGGGGCCAG AAGGTTTTCC ATACTCCGGG GAATTTATCG AACTCTTGTC CGCTACAGGT
GATTGGAGAG GGAAATCGCG GGCATCGCTT GCCGAAAATA CTGAGCCGAC TTTCCTTTCA
TTGCCTATCG TAAATCGGTG GTGCGGCTCT CTTCTCGACA GCATTGGGGT ACTCGACGGA
ATTTCCTCCG CCCCTGATCG CGAACTTGAT ATCGCACTCG CAAATGAATT TTTCCATCAA
CCGCAGTCTC CTCGAGCGTA CCTCGCAAGC TCGCTCTTCG CAATGAAATC CGCGCAAACT
TTGGCAGACG TGAAGTCCGC GCTCTATCGC AGAGAAATTG CAAACGCGCA TTTCAATAGC
ATGGGGCTCA ACTCGGATCA CATCGAGCGG CGACGTATAG AGTTTACTTT TGACGCTCTC
TCAGTCGAGG TCGGAAAGTC TGAGTCTGTC TATGAAAATC GAGAGCTTTT ACGAATTGCT
TGTATCTGCG GAATAAGCGA AGGGAGGACT CTTGAGACGC TTATTCTCCT GTTTAGCTAC
ACAGGCCAGG ATCCGCTAGC TGCGGGATAT TTTCCAGCGA GTTTGTTCTC AAGCAGTATA
ACAGAAGACG AGGTGGCCGA TATCGGACAC GATGCCAGAG TTGCGATTGC GCTTTCGCGT
GTTGCTGGTA GCCTCGGAGA TGAGGGGCAA AATCTGGTCT ATATAGCAGT AGAACAGCAT
CTCAGTGAGC GCGGGGTCAC CAAGCCGAGC GAGCTAGCTG TCGAAGGGCT TATCGACATC
GCTTTTCTCA GTGAAGCGTG CACTTCCGCG TCCTTGAGGC AATCACTTGA GTTTCTGTCC
AAGGCTGAGA TGGAGGAAGA GCGGATAAAG GTGCTCTTGA ATTTGGCGCA GGCGAATTCG
GAGAATGAGG ACGAGTACAT TGACGAGGTT CACGCAATCA TCGGTCAACA GACCATCGAA
GAGCTCCTTC AGAGGTTTCA TGTAGGCAAG GTACAATGCG ACGAGCAAGC GCTTGCAACA
TGGGCGCTGA CAGAGTTGTC GCCAAAATTC AATCGCCTTA AGGATTTCAT CGACGCTGGC
TTGCCGCCTG TCGAGAAGAA TGCTGACGTT GAGTTCATCG CCCATTTGAC TTCAGGAAAG
TCTGAAACCT TCACGTTCAA AGTCCCGAAT AACGAGTCCC TTGATATTGC CCGGACCATA
CTGGCCGAGC TGAACTCAAA GTATGCGTTG GACCCTCGCT ACGGTGTCGA TTCCTATTTG
AGTCTCGGCA TGCGTCATGG TGCCGTCGAA GCTCACCTTC AGAGCCCGCT AATCGCAGAG
AATATTCTTA CTACCAAGGA GGCCCTTGGT TATCCAGAAG ACTGCTTCTG GAAGGGGTAT
TTCCTTGATA ACGGTTATGA GTACTACGGG GAGATGATTG GGCCGATTTT GGCTAGATTT
TCGGAGAAGT TCGATAATAA ACTGGAAGCG ATCAAGAATG ATCTCCTGCA GGTGCGACGG
CCCGATAAAC CAGAGGGGCT GATTGTTGCT GACTGGTCGG AAGCATCGGT CTTGTCCACG
TGCGCCAGAT TCGCTGAGGT CGTTAACTTC GAGGCATTGA TCACGGAATT CACGTCAATT
TTCTGGGCGA ACATAGAAGG TAATCTTGGC AACGCTCGCG AGTTCATTGA AAACGTGCTT
TCGAACGAAC TAAATGAGCT GATCGACGAG TTAGAAGCCG ATGTGCGTCA AGCTACCGGA
CAACAGAGGT TACCTCCGTT CTCGGATGCT CTGATGCGGG CGCGCGAGGA ACTCAGCAAC
GCGGTGAAAG ATATATCTTC GTGGCATAAT GTTGCTCGCT CCACCCATGT CGAGCCTCTT
GGATTGGTCG ATATCATTAG CGCGGCGCAG AAAATTGTCT GCCGCCTTTA TCCCGATTTT
CAGCCGCGTG TGACGTTTTC TGGAGAAACA GGGATCTCAG TAACCTATTC GCTGCAGGTG
CTAATCGAGG TCTTCAAAGC GCTGTTTACG AACGTGTACG CTCATTCTGA AGTCGAGACG
CCGTCTGTCA ATGTACACAT GACAGTCTCA GGCGAAGACG CGCTGAATGT TGAATTTGCC
AGTGACTGCA AGGATCTGAA CAAGGCCGAG CAAGCAGCCT TAGATAACAA TGAAAAAATC
AAAACTGGCG AATACGAAAA AAAACTACCG AAAGAAGGTG GGTCCGGATT GGCGAAGGTT
GCTCGTTCAA CTCTCCGGGA CGGCAAACCA AACACTATCA TTTCCGTTGA TCATGTTGCT
AGAAAGTTTC GTGTGAGTAT GACGTTCAGA ATAATCCAGA TTTGA
 
Protein sequence
MAYRKITKAQ KSRVRSSALT PQKRRVRVSK GAHATLLSIR NEAPTVEVLS RTIEWTLARY 
KYHRYLVSLV PRTPSQLPDY VLATGDHLED ALKWQLAAIS AARAKVERHN RNAYTSIDQV
TDYSASLESA NAEMRTCMWS YSAASQALFS LAKTKGLDAQ RRWMQRNVYV SNPSISNIIL
YSKGIASEGD RHPVDIMDVL NRFIFSKMVD DELNLLLYYL IFNPPLNLDA AAKLSPLLLY
FPLVDQYEFL ANLVTSDPNS SGPEGFPYSG EFIELLSATG DWRGKSRASL AENTEPTFLS
LPIVNRWCGS LLDSIGVLDG ISSAPDRELD IALANEFFHQ PQSPRAYLAS SLFAMKSAQT
LADVKSALYR REIANAHFNS MGLNSDHIER RRIEFTFDAL SVEVGKSESV YENRELLRIA
CICGISEGRT LETLILLFSY TGQDPLAAGY FPASLFSSSI TEDEVADIGH DARVAIALSR
VAGSLGDEGQ NLVYIAVEQH LSERGVTKPS ELAVEGLIDI AFLSEACTSA SLRQSLEFLS
KAEMEEERIK VLLNLAQANS ENEDEYIDEV HAIIGQQTIE ELLQRFHVGK VQCDEQALAT
WALTELSPKF NRLKDFIDAG LPPVEKNADV EFIAHLTSGK SETFTFKVPN NESLDIARTI
LAELNSKYAL DPRYGVDSYL SLGMRHGAVE AHLQSPLIAE NILTTKEALG YPEDCFWKGY
FLDNGYEYYG EMIGPILARF SEKFDNKLEA IKNDLLQVRR PDKPEGLIVA DWSEASVLST
CARFAEVVNF EALITEFTSI FWANIEGNLG NAREFIENVL SNELNELIDE LEADVRQATG
QQRLPPFSDA LMRAREELSN AVKDISSWHN VARSTHVEPL GLVDIISAAQ KIVCRLYPDF
QPRVTFSGET GISVTYSLQV LIEVFKALFT NVYAHSEVET PSVNVHMTVS GEDALNVEFA
SDCKDLNKAE QAALDNNEKI KTGEYEKKLP KEGGSGLAKV ARSTLRDGKP NTIISVDHVA
RKFRVSMTFR IIQI