Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4033 |
Symbol | |
ID | 6982804 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 4207463 |
End bp | 4209253 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643398763 |
Product | hypothetical protein |
Protein accession | YP_002283521 |
Protein GI | 209551604 |
COG category | [S] Function unknown |
COG ID | [COG5616] Predicted integral membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAGTG CAGGCATAAA TAGGATCGCC GCCTCGCAGG AGGAGGTAGA GCAGCAGCTT GAGCGCATTC TTTCCAGCCG CGAGTTCCGC CTGCCCGAAC GAACGAGGAA GTTCCTCGAA TTCGTGGTCA CGGAAACGCT GGCGGGCCGT CGCGACTATC TGAAGGCCTT TACCATCGCA CAGGCCGTTT TTGGCCGGGA CGCGAACTTT GATGCCCAGC AGGATCCCTG CGTTCGTATC GAAGCCGGCC GGCTGAGACG GGAACTCGAA CACTATTACC TCACAGCCGG CGGCACCGAC CGGATCATCA TCACCATCCC GAAGGGCGGC TACGCGCCGG TCTTCGATGT CATCGGCGGC GCTGAACCCG CTGATATCCT GCCGCTCGGG CAGCCCGAAC GGCCGGGCGT GTCAGGCGCC GATCACGGAC AGATCCATGC GGGAACGGAT GCGACCAGCC GGAACCCCGG GGGCTGGCGC CTTTCGCCTC GATATTGGCT GCTTGCGGCA GGGGCGGTGA TTATCCTCGC CTCAGCAGCA GCTCTGCTTC GGCAGGTCGA ATCGCCGAGT GCGGAGCGCG AGGCGGGACC GAGCGCCAAT AACCGCCCCA CCATTATCGT CGAGCGTTTT GAAAGCGGCT CCGGTGGGAA CCTTGCCTCC GATATTTCTC GCGGCATGAC CGACGACATC ATCGAGAAGC TGGTGCGCTT CAATGACATC GTTGTCGTCA CCGCCATGCC GCGGAATAAA TCCGGCCAAG TTTCGGCAGA GTCGCTCTAT GCGCTGCAAG GAAGTGTGCG GCTCGAAGGC AGCATGCTGC GCTCGACGGC AAGGCTCGTG CGGCGGGCGG ATGCGGCTGT CATCTGGGCA AGCAATTACG ACGCCGATAT GACGGTGCAA GGCATCCCGA AAACGCAAGC GAGCCTTGCC GGGGATATCG CGACTGCGGT CGCGCGCCCG TTCGGCGTCA TGTTCCAAAC CGATACCGCG ACCATCGCCG GACGCACGGA CGCCTTTTCA TGCATTCTCT CCTACTATAG CTACCGCAGC GAAATGACCG TGCAGGCTCA TGAGGTGGCG AAATCCTGCC TGCAACGGGC CGTGGAGAAG ATGCCGGCCG ATTCCAATGT CGTGGCCCTA CTTTCGTTGA TCCATCTCGA CGAGTTCCGC TTTTCATACC AACTTCACAC GAAATCGACG GCCGCGACGC TTGGCCTGGC AAAGCAACTT GCCGAGCATG CGGTGCGGCT CGACCCGAAG AATGCACGCG CTCTTCAGGC GCTGATGCTT GCCAATTTTT TCGACAATGA TCCGGCTGCG GCCCTCAGCG CCGGCGCCGG CGCCTATGCC AGCAATCCAA ACGATACCGA AGTGGCCGGT GAATACGGCC TGCGGCTGTC GATGTCGGGG GAATGGGACA GAGGCTGCAC GCTGATTTCA GAAGCGGTCG GCAAGAATGC GGGGCCACGC GGATATTACG AGGTCGGAAT GGCGCTCTGC GCCTTCATGC GGGGCGATAC ACAGGCAGCG GAACTCTGGT CGCGCATGTC GGATCTCAAC TACAATCCGA TGCATCGCCT CGTATTGCTC TCCATTCTCG GCGCGCTTGG AAAAAAGCAG GAGGCAAAAG AACAGCTTGA ATGGATCCGG CGCGAGTCAC CCGCGTTGAT CCCGCACATC AGGCAGGAAG TCACAAGGCG GCTGGCGCGG ACCGAGGATC AGCGGCGGTT TCTTGCGGGA ATAGAGGCTG CCGGTTTGTC GGTGCAAGAT GGTGAGGCGC CGAAGGATTG A
|
Protein sequence | MTSAGINRIA ASQEEVEQQL ERILSSREFR LPERTRKFLE FVVTETLAGR RDYLKAFTIA QAVFGRDANF DAQQDPCVRI EAGRLRRELE HYYLTAGGTD RIIITIPKGG YAPVFDVIGG AEPADILPLG QPERPGVSGA DHGQIHAGTD ATSRNPGGWR LSPRYWLLAA GAVIILASAA ALLRQVESPS AEREAGPSAN NRPTIIVERF ESGSGGNLAS DISRGMTDDI IEKLVRFNDI VVVTAMPRNK SGQVSAESLY ALQGSVRLEG SMLRSTARLV RRADAAVIWA SNYDADMTVQ GIPKTQASLA GDIATAVARP FGVMFQTDTA TIAGRTDAFS CILSYYSYRS EMTVQAHEVA KSCLQRAVEK MPADSNVVAL LSLIHLDEFR FSYQLHTKST AATLGLAKQL AEHAVRLDPK NARALQALML ANFFDNDPAA ALSAGAGAYA SNPNDTEVAG EYGLRLSMSG EWDRGCTLIS EAVGKNAGPR GYYEVGMALC AFMRGDTQAA ELWSRMSDLN YNPMHRLVLL SILGALGKKQ EAKEQLEWIR RESPALIPHI RQEVTRRLAR TEDQRRFLAG IEAAGLSVQD GEAPKD
|
| |