Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2399 |
Symbol | |
ID | 8013384 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 2399878 |
End bp | 2401698 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644824980 |
Product | hypothetical protein |
Protein accession | YP_002976210 |
Protein GI | 241205114 |
COG category | [S] Function unknown |
COG ID | [COG5616] Predicted integral membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.797611 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.00673398 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAGACAT CGATACCAGC AGAGGGGTCT GCTGCACCCA TTTCTTCAAG GCCGGCGCCT CAAGCCGACG ATATCAGCGA GCAACTGGAA CGTGTCGTCT CCAGCCCTGA ATTTCCGGGC GTCGGTCGCG CGGCAGCGTT TCTCCGCTAT GTCGTCTCGG AAACCCTCGA GGGTCGGGGC AACCGGATCA AAGCCTATTC GATCGCGATC GAGGTGTTCG GCCGGGATCC CGGCTTCACC CAGGACGATC CGGTGGTCAG GATCGAGGCC GGACGGCTGC GGCGGTCGCT CGAGCGCTAT TATCTCGTCG CCGGGCAGCA TGACCCTGTC AGGATCGACA TCCCCAAAGG CGGATATGTT CCAACCTTCG CCTGGTCCTG TCCGGAGTCA GCCGAAATTG GAGACGAAGA CGCTGGCGAA ACATCGCCCT CTGAGGTCCG CCCCGGAGGC TGGTGGCGCG CAAGGCGGGT GTTGTTGCCG GGTGCTGCTG CGCTCGCTGC GGTGACTATT TCGCTCTATT GGATCGGGAC GCGATCTCCC GCTCCGTCGC TCGAACGCGT TGCAAGCCTG TCGCCCGATC GTCCGGCCCT CGTGGTCGCT CCCTTTGCCA ATCTCGGCGA AGGGCCGGAG GCACAGCTCT ACACGGCTGG CCTGACCGAG GAACTGATGA CCATCCTGCC ACGGTTCAAG GAGATCAAGG TCTTCGGCCG CGAAACGTCC AAGTCCCTCC CCGCGGATGT GGGCGCATCG GAGATTCGGG CCGAATTCGG CGCACGCTAC CTCCTTGCCG GCGGGGTACG CACGTCGGGC AAGCGTCTTC GTGTGACGGC GAGACTGCTC GATACCTCGG ACGGCGAGAT CCTCTGGTCG GAGAACTATG ACAATGATCT CGCATCGGGA GACCTGTTTG CGATCCAGAC GGATGTCGCC AGAAAGGTCG CGACCGCCAT CGCCCAACCC TACGGGGTCA TGGCACAGAT CGACTCCGCC GGTCCGCCGC CGGACGACCT CGGCGCGTAT GAGTGCACGT TGCGCTTCTA TGCCTATCGT TCGGAACTGA GCGCCGAAGC GCATGCGCGC GTCCGGGATT GCCTAGAAGC TGCGGTCGCG CGGTTTCCGA GCTATGCGAC CGCCTGGGCA ATGCTGTCGA TCGTCTATCT CGATGAGGAC CGGTACAAGT TCAATCCGAC ACCAGGCCAG GATTCGGCCA TACAGCGCGC ACTCGATGCC GCCAGGCGCG CAACGCAGAT CGATCCGAAC AATACGCGCG GACTTCAAGC GCTGATGACG GCGCTGTTCT TCGACAGGCA GCTGGCTGAA TCACTTCGTG TGGGCGAGCA GGCGCTCGCC ACCAATCCCA ATGACACCGA ACTGATGGGC GAGTTCGGAA CGCGGCTTGC GATCGCTGGC CAGTGGCAAC GTGGAGCAAG CCTGCTGGAC CAGGCCATCG CGCTCAATCC AGGCAGCGGC GGCTTCTACC ACGGAACGCG AGCCCTGGCC GCCTACATGC TCCGCGACAA CCACACCGCC GTGCTGGAGA TCAGGCAGGC GAACATGCAG AAGTTCCCTC TATTCCATGT TGTTGCCGCC ATCATCTACG CAGAAGCTGG CATGATGGAC GACGCCCGCC GGGAAGGACA GGTGTTCGTC GGCATGCGAC CAGACTTCCT GCCGAACATT GTAACGGAAC TCGCAATGCG CAACATGCAG CCCGAAGATC GCAATCGTCT GATCGAAGGA CTTCGCAAAG CGGGGATGAC GGTGCCCGAT CCCGACGCAA TTGCGTCGAC TGCCGCCACC TCGGACCTGC AACCTCGCTG A
|
Protein sequence | MQTSIPAEGS AAPISSRPAP QADDISEQLE RVVSSPEFPG VGRAAAFLRY VVSETLEGRG NRIKAYSIAI EVFGRDPGFT QDDPVVRIEA GRLRRSLERY YLVAGQHDPV RIDIPKGGYV PTFAWSCPES AEIGDEDAGE TSPSEVRPGG WWRARRVLLP GAAALAAVTI SLYWIGTRSP APSLERVASL SPDRPALVVA PFANLGEGPE AQLYTAGLTE ELMTILPRFK EIKVFGRETS KSLPADVGAS EIRAEFGARY LLAGGVRTSG KRLRVTARLL DTSDGEILWS ENYDNDLASG DLFAIQTDVA RKVATAIAQP YGVMAQIDSA GPPPDDLGAY ECTLRFYAYR SELSAEAHAR VRDCLEAAVA RFPSYATAWA MLSIVYLDED RYKFNPTPGQ DSAIQRALDA ARRATQIDPN NTRGLQALMT ALFFDRQLAE SLRVGEQALA TNPNDTELMG EFGTRLAIAG QWQRGASLLD QAIALNPGSG GFYHGTRALA AYMLRDNHTA VLEIRQANMQ KFPLFHVVAA IIYAEAGMMD DARREGQVFV GMRPDFLPNI VTELAMRNMQ PEDRNRLIEG LRKAGMTVPD PDAIASTAAT SDLQPR
|
| |