Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_0545 |
Symbol | |
ID | 6979261 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 564071 |
End bp | 567406 |
Gene Length | 3336 bp |
Protein Length | 1111 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643395257 |
Product | protein of unknown function DUF1217 |
Protein accession | YP_002280068 |
Protein GI | 209548151 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000988824 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTACGG CTTCCCTCGC TTACACGATC CTGTCGAAGG ATATGACGTC GAGCCTGAAC AAGGTGGCGT CGCAGGCGAC GGTCAAGAAG GACGCCGAAT ACTACGCCGA TCATATCAAC AAGGTCACCT CGGTCGACGA CTTCCTCGGC GATTACAAGC TCTACAGCTA TGCGATGAAA GCCTATGGTC TCGAGGATAT GACCTACGCC AAGGCCTTCA TGAAGAAGGT GCTGGAAAGC GATCTCACCG ACCCCAACAG CTACGCCAAC AAGCTCTCCG ATACGCGCTA TCGCGAATTC GCGTCCGCCT TCAATTTCAA TGCGCCCGAG AAAGACGTGC AGACGGACGC GCAGGAGGAC GATCTGATCG GCCTCTACAA GCAGTCCTTC ATCGATGCCG ACAAGGCGAC GACTGCCGAG AGCACTTATT ACAGCAACAA TATCGACAGC GTGACGACCG TCGACGATCT GGTCAACAAT ACGAGGCTTC GCACCTACGT GCTGAAGACC TTCAACATCG ATCCCACCTA TGCGTCGAAG GATTTTCTGC GCCAGGTGCT GACGAGCGAT CTGAGTGACC CCACGAGCGT CGTCAACACG CAAGGGGGCG ACAAGTACAA GGCGCTTGCC GCCCAGTTCA GCTTCAACGC CGACGGCACG GTCACCGGCA CGGCCCAGAC CGCGGCGCAG AAGGCTTCGG TCATCGAAAC CTACACGCTG AATTCCCAGT CGGTCATCAT CGACAATTCG GTCGGCTCCG ACGTTGTCTA CGTCAACAAG ACCGCCGCCG ACTATAACCA GGCCTATTAT ACCGCCAAGA TCGGCACGAT CACCAATGTC GACGATCTGG TCGCAGACAA ACGTCTGACC TCCTACATCA AGACGGCCTA CAGCATGGGC GCCGATTTCA CTGCGGCAGC ACTGCGCACG GTCCTGACCG ACCCCGGTTA TGCCCAGCTG ATGGGTTTTA CCAATGTCTA CAACGCCTTC AACTTCAAGT CCGACGGTTC GACCTCGAGC ACGGCGCGTG TGCGGACGCT CGATCAGGCA AACAAGCTGT CGTCGGCCGC ATCGCAGACC GCCAACTACT ATAAGGTGAC CTCGCAATCG AGCAGCATCA CCAATGTCGA CGCCCTGCTC GCCGACGGAA ACATGGCGCG CTATATCAAG GATGCCTATG GTCTCGGCAC CGGTTTCAGC AATGCCGATC TGAAGAACAT CCTGACCGAC TCCGCCTATG CCGCCGCGCA AGGCCATGCC GATCTCAATG CCGATTTCAA CTTCCAGGCG GATGGATCGA TCAACGGGTC GGTGATCCAG ACGGACACGC AGCGCAAGTC GACCACCGAC AAATCGGCGG CGAACGCGGC CCACTTCAAC GCCATGATCG ACAGTGTCAC CAGTGTCGAC GACATCATGT CCGATCCCGT TGCGGTCAGC TATCTCAGAA CCAGCATGCA GGTCGCCGAC AGCGTCTCCG ATGCGACGCT GAGGACCTTC CTCGTCGATC CCGCCGCCGC CAGCGCCCAG GGTTATAGCG ACGTCCACGA TCTCTTCAAT TTCAAGACCG ACGGTTCTGT CGCCACCCTC TACGCCTCGC AGACGGCCGC GCAAAGTGCC AGCACATCAA GCAAGGCCGA TAATGCGGCC GTCTATTACC AGGCGACCAT CGCCGGCATC GCCAATGTCG ATCAGTTGCT ATCGGACCAG AAGCTCAACA ATTTCGTGCG CAACGCCTTC GGCATCCCGT CCACCGTGAC CGACGTCGCC CTTCGCGGCA TCCTGACCGA TCAGAGCGGC ACCGGCACCT ATGCCGACGT CGCCGCCGCC TTCAATTTCA AGGCCGACGG CTCTCTCAAG GACGGCCTGG CGGCGCAGAC GGACACCCAG ATCCGCAACA CGAAATTTGC CGCCGGAGCG CGCACCGACG ATTATTCCGC CCGGATGGCG ACGATTGCCA ATGTCGACGA TCTCATCGCC GATCCCGCCA TCACCAATTT CCTGAAGAGC ACCTATAATC TGCCGTTGAA CATTTCGAAC GCCGACCTGA AGAGCTACCT GACCGATGCG ACGGCTGCCT CGGCCGCCGG TTATGCCGAT CTCAACGCCG ATTTCAACTT CGCCGCCGAT GGATCGCTGC CGGTCGTCAG CTCCGTCCAG ACGGCGGATC AGGCCCAGAC CACCAATGAC AATTATATGG CACGCTATGA CGACGAGCGC GAAGAGGCGA TCGACGAGGT TGCCTCCAAC TACTCGAGCA TGATGGCCGA CAGCACCAGT CTGCTCGACA CGGCCGAGAT CAAATCCGTC AACGATTTCA TGCGCACCAA TGCCACGGCC GATTTCAAGA AGAGCAACGA CAATCTGCCG GACCCCTATC ATGTGGCGTT GCAGGCCTTC GGTCTGACCG AGCAGGACGT GCCGCGCTCG ATGATGCGCA AGATCCTGAC GAGCGATGCC TATGATCCGA ACGGCTATAT CGCCTCGCTA AAGGACGAAC GCATCACCAA TATGGCCCGC GCCTTCAATT TCGGCCCCGA CGGCAAGGCG GCCTCACCCT TCCAGGCGCT GCCCGACGCG ACTCTGGCCA AATACGCCAC CGACTACAAG GCGCATATGA CCATGCTGCT GAAGGCCGGC CCGGTAAAGG ATAAGGCATC GAAGGACGCG ACGACCGAGG TGGATTATTT CGCCAAGGGC ATGGCCAAGG TGAAGTCGCT CGACGATTTC CTCGACGACA GCCGCCTGAC GGATCTGGTG CTGAAAGCCA ACAATCTCGA CCCGAAGGAT TACGACAAGG CGACGCTGAA GAAGATCTTC ACTTCCGATC CCGACGACAA GAAGAGCTAC CTGAACGCCA AGGCCGATGC GCGCTTCAAG GATATCGTCG CCGCCTTCAA CTTCGACAAG GACGGCAATC TCACCCGCGC CAAGATGGGC ACCATCCAGA ACAAGGCGGC CGAAGAGCAC ACTCAGGAAC TCTACGTCCA GCAGACGATG GAAGCCCAGC AGGGTGAAAG CAACGATGGT GTGCGCCTGG CGCTCTATTT CAGCCGCAAG GCCCCGAGCA TCACCTCGAT CTATTCGATC CTCGGCGACA AGGCGCTCTA TCAGGTCGTC ACCACAGCCT ACAGCCTGCC CTCGCAGATG TCGGGCATGG ACGTCACCAA ACAGGCCGAC CTCCTCAACC GCTTCGTCAA GCTCGAGGAT CTCCAGGATC CGAAGAAGGT CGACAAGCTG GTGCGCCGCT TCACCGCCAT GTACGACGTC CAGAACAGCA CCCAGCAATC GCCGGCTCTG CAGATCCTGA CCGGCGGCGG CACGCAGTCG TCTTAA
|
Protein sequence | MITASLAYTI LSKDMTSSLN KVASQATVKK DAEYYADHIN KVTSVDDFLG DYKLYSYAMK AYGLEDMTYA KAFMKKVLES DLTDPNSYAN KLSDTRYREF ASAFNFNAPE KDVQTDAQED DLIGLYKQSF IDADKATTAE STYYSNNIDS VTTVDDLVNN TRLRTYVLKT FNIDPTYASK DFLRQVLTSD LSDPTSVVNT QGGDKYKALA AQFSFNADGT VTGTAQTAAQ KASVIETYTL NSQSVIIDNS VGSDVVYVNK TAADYNQAYY TAKIGTITNV DDLVADKRLT SYIKTAYSMG ADFTAAALRT VLTDPGYAQL MGFTNVYNAF NFKSDGSTSS TARVRTLDQA NKLSSAASQT ANYYKVTSQS SSITNVDALL ADGNMARYIK DAYGLGTGFS NADLKNILTD SAYAAAQGHA DLNADFNFQA DGSINGSVIQ TDTQRKSTTD KSAANAAHFN AMIDSVTSVD DIMSDPVAVS YLRTSMQVAD SVSDATLRTF LVDPAAASAQ GYSDVHDLFN FKTDGSVATL YASQTAAQSA STSSKADNAA VYYQATIAGI ANVDQLLSDQ KLNNFVRNAF GIPSTVTDVA LRGILTDQSG TGTYADVAAA FNFKADGSLK DGLAAQTDTQ IRNTKFAAGA RTDDYSARMA TIANVDDLIA DPAITNFLKS TYNLPLNISN ADLKSYLTDA TAASAAGYAD LNADFNFAAD GSLPVVSSVQ TADQAQTTND NYMARYDDER EEAIDEVASN YSSMMADSTS LLDTAEIKSV NDFMRTNATA DFKKSNDNLP DPYHVALQAF GLTEQDVPRS MMRKILTSDA YDPNGYIASL KDERITNMAR AFNFGPDGKA ASPFQALPDA TLAKYATDYK AHMTMLLKAG PVKDKASKDA TTEVDYFAKG MAKVKSLDDF LDDSRLTDLV LKANNLDPKD YDKATLKKIF TSDPDDKKSY LNAKADARFK DIVAAFNFDK DGNLTRAKMG TIQNKAAEEH TQELYVQQTM EAQQGESNDG VRLALYFSRK APSITSIYSI LGDKALYQVV TTAYSLPSQM SGMDVTKQAD LLNRFVKLED LQDPKKVDKL VRRFTAMYDV QNSTQQSPAL QILTGGGTQS S
|
| |