Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3385 |
Symbol | |
ID | 6982139 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 3497566 |
End bp | 3498555 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643398103 |
Product | hypothetical protein |
Protein accession | YP_002282878 |
Protein GI | 209550961 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.673193 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.250549 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGACAG TTGCACCGCT TGATCTCGAC GGCCACGTTC TGGCCGTCGA ATTTTTAGGT GATGTCCCCT TCTTCGCAAA CGCCAGCGGC ACGTTTCACC GGCTGGACGG CGGCGACAGG GTTTGCGAAG CCCATCAGGG CATGCTCACC GCCATCCGCG ATCCCTATAG CGAGAGCCTG ATCTCCGGCG GCGAAGACGG CAAGGTGCTG CGCATTGCAG CCGACGGCAG CGTCTGCGAG CTTGCCACTG CGCCACGCAA GTGGATCTCG CAGGTTGCGG CCGGCCCGCA AGGCGCCATC GCTTATTCCT ACGGCAAGAG TTCGCTCGTG CGCCTTGCCG ACGGCACGAC CAAGGAATTT GCCGAGGAGC GCACGGTCGA AGGTCTTGCC TTCGCGCCGA AGGGCCTGCG CATCGCAGCC GCGCGCTATA ACGGCGTGTC GCTGCATTGG ATCGGCATGA ACGCCAAGCC CATCGACCTC GAATGGAAGG GGGCGCATAC CGGCGTCACC TTCTCGCCCG ATGGCAATTT CCTCGTCACC TCGATGCAGG AAAACGCGCT GCACGGCTGG AAACTCGACA GCAAGCCTGG CGCTGAAGCC CGCCATATGC GCATGACCGG CTACCCCTCC AAGGTGAAAT CGCTCTCCTG GTCGGTCAAG GGCAAATGGC TCGCCTCATC CGGCGCGCCT GCCGCCATCG TTTGGCCCTT CCAAGGCAAG GACGGGCCGA TGGGCAAGGC GCCGCTCGAG CTCGGCACCC GCGCCAACAT CATGGCGACC GCGGTGAAAT TCCATCCGCT CGAAGATATC CTCGCCATCG GCTTCATCGA CGGCATGATC CTCGCCGTGC GCATCGCCGA CAGCAAGGAG GCGCTGCTGC GCCGGCCCGG CAAGGGCGCG ATCACGGCGA TGAGCTGGAG CAAAAACGGC AAGCTGCTCG CCTTCGCCTC CGAAGCCGGC GATTGCGGCG TCATCGATAT TTCGGCTTGA
|
Protein sequence | MPTVAPLDLD GHVLAVEFLG DVPFFANASG TFHRLDGGDR VCEAHQGMLT AIRDPYSESL ISGGEDGKVL RIAADGSVCE LATAPRKWIS QVAAGPQGAI AYSYGKSSLV RLADGTTKEF AEERTVEGLA FAPKGLRIAA ARYNGVSLHW IGMNAKPIDL EWKGAHTGVT FSPDGNFLVT SMQENALHGW KLDSKPGAEA RHMRMTGYPS KVKSLSWSVK GKWLASSGAP AAIVWPFQGK DGPMGKAPLE LGTRANIMAT AVKFHPLEDI LAIGFIDGMI LAVRIADSKE ALLRRPGKGA ITAMSWSKNG KLLAFASEAG DCGVIDISA
|
| |