Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4347 |
Symbol | |
ID | 8015122 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 4470734 |
End bp | 4471807 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644826923 |
Product | oxidoreductase domain protein |
Protein accession | YP_002978126 |
Protein GI | 241207030 |
COG category | [R] General function prediction only |
COG ID | [COG0673] Predicted dehydrogenases and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.000110222 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCATCA GGACGGTGGC GATCGTTGGT TGCGGTATCG GCCGCTCCCA CATCGTCGAG GGTTACCTGC CGCATTCCGA TAAGTTCAAG GTCGTGGCGA TCTGCGACCT GAACGAGCAG CGCATGGCGG CGGTCGGCGA CGAGTTCGGC ATCGAACGGC GCACCACCTC TTTTGCGGAA CTGCTGGCCG ACGACACGAT CGACATCATC GATATCTGCA CCCCTCCCGG CATCCATCTG GAACAGGTGG TCGCGGCCCT GGCTGCCGGC AAACATGTCG TCTGCGAAAA GCCGCTGACA GGCTCGCTTG CCGCCGTCGA TACCATCATG GAAGCGGAAA AGACCGCCAA AGGCGTGCTG ATGCCGATCT TCCAGTATCG TTACGGTGAC GGCATTCAGA AGGCCAAGCG GATCATCGAC GCCGGTATTG CCGGCAAGCC CTATACCGCT TCGGTCGAAA CCTTCTGGCT GCGCAAGCCG GAATATTACG CTGTGCCTTG GCGCGGTAAA TGGGCGACGG AGCTTGGCGG CGTGCTCGTC ACCCATGCCC TGCATCTGCA CGACATGCTC ATGCATCTGA TGGGTCCGGC GGCAAGGGTC TTCGGCCGCG TCGCCACCCG AGTCAACGAT ATCGAGGTCG AGGATTGTGC CTCCGCCAGC CTGCTGATGG AAAACGGCGC CTTTGTCTCG CTGTCTTGCA CGCTTGGTTC GCAGGAGCAG ATTAGCCGGC TCAGGCTGCA CTTCGAGAAC GTCACTTTCG AAAGCAGCCA CGAACCCTAT ACCCCCGGCA AGGACCCCTG GAAGATCATC GCCGCGAATG ACGACGTGCG GGAAAAGATC GAACGGGTGG TTGGCGACTG GCAGCCGGTC GCGCCGCGTT TCACCACGCA GATGGGCCAG TTCCATGCCT TTTTGAGTGG CCATGCGCCG CTGCCGGTAA CGAGCTGGGA CGCGCGCCGG GCGCTGGAAC TCGTCACCGC CATCTACCAA TCTTCCGACA GCGGCGCTGA CGTGCCGCTG CCGGTCGGTC CCGACAGTCC GAAATACGCC GATTGGCGCG CAAGAACGAA GTAA
|
Protein sequence | MSIRTVAIVG CGIGRSHIVE GYLPHSDKFK VVAICDLNEQ RMAAVGDEFG IERRTTSFAE LLADDTIDII DICTPPGIHL EQVVAALAAG KHVVCEKPLT GSLAAVDTIM EAEKTAKGVL MPIFQYRYGD GIQKAKRIID AGIAGKPYTA SVETFWLRKP EYYAVPWRGK WATELGGVLV THALHLHDML MHLMGPAARV FGRVATRVND IEVEDCASAS LLMENGAFVS LSCTLGSQEQ ISRLRLHFEN VTFESSHEPY TPGKDPWKII AANDDVREKI ERVVGDWQPV APRFTTQMGQ FHAFLSGHAP LPVTSWDARR ALELVTAIYQ SSDSGADVPL PVGPDSPKYA DWRARTK
|
| |