Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3058 |
Symbol | |
ID | 8013970 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 3053215 |
End bp | 3056028 |
Gene Length | 2814 bp |
Protein Length | 937 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644825626 |
Product | hypothetical protein |
Protein accession | YP_002976854 |
Protein GI | 241205758 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02226] N-terminal double-transmembrane domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0693886 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCCC TTCCCTTCGC TTTCGCCTAT CCCGCCATTC TCGGCGCGCT CGTCGCCTTG CCGGTCATCT GGTGGCTGCT ACGGCTGACG CCGCCACGTC CGAAGACCGA AGTCTTCCCG CCGCTGAAGA TCCTCGCCTC GGTGCTGAAG CGCGAGGAGA CGCCGGCGCA GAGCCCCTGG TGGCTGACGC TGCTGCGCAT GCTGCTCGCC GCCACCATCA TCCTTGCAAT CGCCGACCCG GTTTTCAATC CACGCACCAG TTCGCTGGCA TCAGGTGGCC CGCTCGTCCT CTTCGTCGAC AACAGCTGGG CGGCGGCACC CGACTGGGAG CGCCGCATCC AGACCGCCGA CGCGTTGATA GACGACGCCG AATCCGCCGG CACGCCTGTG TCGATCGCCT TCACCGCCGA TCCGAGCAAC GACGCCGTGC CCGGCACCGC CGCTGCCGCC CGCGACAAGC TGCGCGCCGC CGAGCCGCGA CCACTGGTAC CGGACCGCGA ACGCGCCTTC CAGGCGCTGC GCGCCGCGCT GAACGGCATC AAGCCCGGCA CCCTCGCCTT CCTGACTGAC GGCGCCGCCG CCAAGGCCGA CGACGCCACG GTGCGCAGGC TGGCCGAACT CCAGCCCGCT GATCTGCGCC TGATCGAGGG CGAGGCCGCG CGGACGGTCG CGATCACTGC CGCCAACAAT GCGGCGGATG CCATGACCGT CAAGGTCACG CGGCTCAACA CCTCGCAGGC CGCATCGGTG GCGCTGAACG CCGTCGATAC CCAGGGCCGC TCGATCGCCA ATGGCAGGGT GGATTTCCGC CCCGGCCAGA GCGTCGCGAC GAGTGCGATC ACCGCTCCCT TCGAGATGCG CAACGATTTC GCGCGCATCA GCGTCGACAA CGGCGCGACG GCAGGCGCCG TCCACCTTCT CGACGACGCA TTCAAGCGCC GCCGCGTCGT CTTGCTGTCA GGTGAAGGAG GGGATGAGTT CCAGCCGCTG CTCTCGCCGC TCTATTATAT CCAGCGGGCG CTGCAACCCT ATGCGGATCT GATCCAACCT GGCGATTCCG ATCTTTCGGT CGCGATACCA AAACTTCTCG CCAACAATCC CTCCATCATC ATCATGGCCG ATATCGGCCG GCTGCCGGAG GAGACCTACG AGCCTCTGAC GCGCTGGATA TCGAATGGCG GCATGCTCCT GCGGTTTGCC GGCCCGCGCA TGGCGGCAGC CCCTGCGGAC GATCCGCTGA TCCCTGTCAT CTTGCGTCAG GGAGAACGGG CGCTCGGCGG CACCTTGTCC TGGAGCGAGC CGCAGTCGCT TGCCGAATTT CCGAGTTTCG GGCCTTTCGC CGGCATAGCG CGTCCCGCCG ATGTCGTCGT CAAGCGGCAG GTGCTCGCCG AACCGACGCC CGATCTTGCC GAACGCACCT GGGCGAGCCT GGCTGACGGC ACACCGCTCG TCACGATGAA GCAGATCGCA TCCGGCCAGA TCGTCCTGTT TCACGTCACC GCGGAAGCAA CCTGGTCCGA TCTGCCGATC TCAGGCACCT TCGTCGACAT GTTGCGTCAC CTCCTGCAGA TATCGCGCTC GGGCGGTGTG ACTTCGGAAG CGCGCGGCAA TGCGCGTGTC TCCGAAACCC TGCCGCCATT CCGCATGCTG ACGGCCAAGG GCACGCTCGT CTCCGAGACG GGATCGGCGC GGCCGCTCAT TCCACAGGCC GGGGTCGAAC CGACGACAAA TTTCGACAAC CCTCCCGGGC TCTACGGTTC GGAGGATGGT TTCGCCTCGT TGAACGTGCT GCCCGAGAAC GCCGAACTGA CGCCGCTCGA CACAACAGGG ACCAATGCCG TGCGCGAAGG CCTGATCGGC GGCGAAAGCT GGTCGGCAAA ACCCGCGCTC TTTCTCGCAG CCTTTCTGCT GCTCCTGGCC GATAGCCTGA TCGTCCTCTT CATGAATGGC GCATTCTCGC GATTGCGCCC GGCTGTCCGA ACCGCCGCGA TGATCGCGGT CGCCGTCGGC GCCGGCTTTC TCGTGCAACC CGGCACGCTT CACGCCGACG ACTCCCAGCC CGGCGACGAC CTTATCCTGC AGAGGCTTGA CAACACCCAT CTCGCCTATG TCGTCACCGG CGAACAGGAT GTGGATAATA TATCCGAGCG CGGTCTTGAG GGACTAACCC AGTTCCTGAC TTTTCGCACG ACGCTGGAGC CGGCGTCGCC CGTCGGCATC GACCTCACCA AAGACGAACT CTCCTTCTAT CCGATCATCT ACTGGCCGGT TTCGGCGACG GCGCCGATGC CGTCGACGGC GGCAATCAGC CGTATCGATG CCTATATGCG CAATGGCGGC ACCGTGCTTT TCGATACGCG TGATCAGATC AGCGCATTGG ACAATGGCGG CAATGTCAGC GCCAACGGTG AGAGATTGCA GGCAATCCTC ACCAATCTCG ACATCCCGCC GCTCGAGCCG GTACCATCAG ATCACGTGCT GACGAAATCC TTCTATCTTC TGTCGAGCTT TCCCGGTCGT TATACGGGCA GCCCCCTCTG GATCGAGTCC CGGCAGGGCG GTCAGGGACC GAGCGAAAAA TCGGCGGCGA CGGCCGACGG CGTTTCGCCG ATCCTGATCA CCGGCAATGA TTTCGCCGGC GCCTGGGCGA TCGACGACAA CGGCGTACCG ATACTGCCGA CCGTGCCGTC GGATGAAACG CAGCGCGAAT ATGCCTACCG TTCCGGCGTC AACATCATGA TGTACATGCT GACCGGCAAC TACAAGACCG ACCAGGTCCA TGTTCCCGAC CTCCTCGAAC GGTTAGGACA ATGA
|
Protein sequence | MNALPFAFAY PAILGALVAL PVIWWLLRLT PPRPKTEVFP PLKILASVLK REETPAQSPW WLTLLRMLLA ATIILAIADP VFNPRTSSLA SGGPLVLFVD NSWAAAPDWE RRIQTADALI DDAESAGTPV SIAFTADPSN DAVPGTAAAA RDKLRAAEPR PLVPDRERAF QALRAALNGI KPGTLAFLTD GAAAKADDAT VRRLAELQPA DLRLIEGEAA RTVAITAANN AADAMTVKVT RLNTSQAASV ALNAVDTQGR SIANGRVDFR PGQSVATSAI TAPFEMRNDF ARISVDNGAT AGAVHLLDDA FKRRRVVLLS GEGGDEFQPL LSPLYYIQRA LQPYADLIQP GDSDLSVAIP KLLANNPSII IMADIGRLPE ETYEPLTRWI SNGGMLLRFA GPRMAAAPAD DPLIPVILRQ GERALGGTLS WSEPQSLAEF PSFGPFAGIA RPADVVVKRQ VLAEPTPDLA ERTWASLADG TPLVTMKQIA SGQIVLFHVT AEATWSDLPI SGTFVDMLRH LLQISRSGGV TSEARGNARV SETLPPFRML TAKGTLVSET GSARPLIPQA GVEPTTNFDN PPGLYGSEDG FASLNVLPEN AELTPLDTTG TNAVREGLIG GESWSAKPAL FLAAFLLLLA DSLIVLFMNG AFSRLRPAVR TAAMIAVAVG AGFLVQPGTL HADDSQPGDD LILQRLDNTH LAYVVTGEQD VDNISERGLE GLTQFLTFRT TLEPASPVGI DLTKDELSFY PIIYWPVSAT APMPSTAAIS RIDAYMRNGG TVLFDTRDQI SALDNGGNVS ANGERLQAIL TNLDIPPLEP VPSDHVLTKS FYLLSSFPGR YTGSPLWIES RQGGQGPSEK SAATADGVSP ILITGNDFAG AWAIDDNGVP ILPTVPSDET QREYAYRSGV NIMMYMLTGN YKTDQVHVPD LLERLGQ
|
| |