Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_0497 |
Symbol | |
ID | 8011692 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 519653 |
End bp | 520963 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644823088 |
Product | protein of unknown function DUF224 cysteine-rich region domain protein |
Protein accession | YP_002974341 |
Protein GI | 241203245 |
COG category | [C] Energy production and conversion |
COG ID | [COG0247] Fe-S oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.352819 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAACCA ACTTCACCCC GACCCAGCTC CTCGATCCCG ATGTCGCCGA ATCCGAGCAG ATTCTGCGCA AATGCGTTCA CTGCGGTTTC TGCACCGCCA CCTGTCCCAC CTATGTGACG CTCGGCAACG AGCTCGACAG CCCGCGCGGC CGCATCTACC TGATCAAGGA CATGCTGGAA AACGGCCGCC CCGCCGATAC CGAAGTCGTC ACCCATATCG ACCGCTGTCT CTCCTGCCTT GCCTGCGTTA CCACCTGTCC CTCCGGCGTC GATTACATGC ATCTGGTCGA TCACGCCCGC GCCCATATCG AAAAGACCTA CAAGCGCCCG TTCATGAACA GGCTGACGCG CGCCATCCTT GCCGCTGTGC TGCCCTATCC CGGTCGCTTT CGCCTGGCGC TCAATCTCGC CCGTCTCGGC CGGCCCTTCG CCGGTCTGAT GCGGGGTGGC GCGTTGAAAC CCTTCGCCGC CATGCTGGCA CTTGCGCCGC GCCGCATCCC CGCTGCTTCG GACTTCGCAA AACCCGGCAC GCATCGGCCC GAAACGGAAC GGCGCGGCCG GGTGGCGATC CTTTCCGGCT GCGCCCAGCC GGTGCTCGAT CCCGGCATCA ACGCGGCGGC GATCCGGCTG TTGACGCGGC TCGGCGTCGA GGTCGTGGTG CCGGAAAGCG AGGTCTGCTG CGGCTCGCTG GTGCATCACA TGGGTCGCGC CGAGCAGGCG CTCGAAAGTG CGCGCGCCAA TGTCGATATC TGGACGCGCG AGATCGAGGG GCAGGGCCTC GATGCGATCA TCATCACTGC TTCGGGCTGC GGCACGACGA TCAAGGATTA CGGCCACATG CTGCGCCTCG ATCCCGCCTA TGCCGCAAAG GCGGCCAGGG TCTCGGCGCT GGCCAAGGAC GTCACCGAAT ATCTCGCAAC CCTCGACCTG CCGGCACACA TGCCGAAGGG TATCACCGTC GCCTATCATT CCGCCTGTTC CATGCAGCAC GGCCAGCGCA TCACGCTCGC GCCGAAGCAA TTATTGAAAG CGGCGGGCTT TACTGTGCGC GATCCGGCGG AAGGCCATTT TTGCTGCGGC TCCGCCGGCA TCTACAACAT CATGCAGCCG GAGATCTCGG CCGTGCTGAA GGCGCGCAAG GTCAAGAACC TCGAGGCGAC CAAGGCCGAT ATCATTGCCA CCGGCAATAT CGGCTGCATC ACCCAGATCG CTACCGGCAC CGGCATTCCA ATCCTGCATA CGGTCGAACT GCTCGATTGG GCCTACGGCG GCGCTGTGCC GGAAAAATTA ACAGGTTTGC CGTTAGGCTG A
|
Protein sequence | MQTNFTPTQL LDPDVAESEQ ILRKCVHCGF CTATCPTYVT LGNELDSPRG RIYLIKDMLE NGRPADTEVV THIDRCLSCL ACVTTCPSGV DYMHLVDHAR AHIEKTYKRP FMNRLTRAIL AAVLPYPGRF RLALNLARLG RPFAGLMRGG ALKPFAAMLA LAPRRIPAAS DFAKPGTHRP ETERRGRVAI LSGCAQPVLD PGINAAAIRL LTRLGVEVVV PESEVCCGSL VHHMGRAEQA LESARANVDI WTREIEGQGL DAIIITASGC GTTIKDYGHM LRLDPAYAAK AARVSALAKD VTEYLATLDL PAHMPKGITV AYHSACSMQH GQRITLAPKQ LLKAAGFTVR DPAEGHFCCG SAGIYNIMQP EISAVLKARK VKNLEATKAD IIATGNIGCI TQIATGTGIP ILHTVELLDW AYGGAVPEKL TGLPLG
|
| |