Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3018 |
Symbol | |
ID | 6981763 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 3078114 |
End bp | 3079163 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643397728 |
Product | Haemin-degrading family protein |
Protein accession | YP_002282511 |
Protein GI | 209550594 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3720] Putative heme degradation protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0282112 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGAAC AGACAAGACC GGCGCCAGCC GAAATCCGGG CGTTTCGCGC CGAAAATCCG AAGATGCGCG AGCGCGATAT CGCCGCCCAG TTGAAGATTT CCGAGGCAGC CCTCGTCGCC GCCGAAACCG GCATCAGCGT GACCCGCATC GATGGCAGCG CGCTGAAGCT TCTCGAACGC GTGGCGAGCC TCGGCGAAGT GATGGCGCTG TCGCGCAACG AAAGTGCCGT GCACGAAAAG ATCGGCGTCT TCGAAAACAT CAAAAGCGGC GTACAGGCCG CAATCGTTCT CGGCGAGAAT ATCGACCTGC GCATCTTCCC GAGCCGATGG GAACATGGCT TCGCCGTATC CAAGAAGGAT GGCGACCAGC TGCGCCTCAG CCTGCAATAT TTCGACAAGG CGGGCAACGC CGTGCACAAG GTGCACCTGC GCCCGAATTC GAATGTCGAG GCCTATCACG CGCTGGTTGC CGAGTTGAAG CTGGAAGACC AGTCGCAGGA CTTCGTCGAG GCCGAGACCG CAGATACCGT CGATGAAACC GCCGACGTCA GCCGCGACGA GCTGCGCGAC AACTGGAGCA GGCTCACCGA CACGCATGAG TTCTTCGGCA TGCTGAAGCG CCTGAAGATC GGCCGCCAGG CGGCCGTGCG CAGCGTCGGC GACGACTATG CCTGGAAGCT CGACAGCAGC GCCACGGCGG AGATGATGCA TGCCTCGGTG AAATCCGGCC TGCCGATCAT GTGCTTCGTC GCCAGTGACG GTGTCGTTCA GATCCATTCC GGCCCGATCT TCAACGTCCA GACCATGGGC CCATGGATTA ATATCATGGA CCCAACCTTC CATCTGCATC TGCGGCAGGA TCACATCGCC GAGACCTGGG CGGTGCGCAA GCCGACCAAA GACGGCCACG TCACCTCGCT GGAGGCTTAC AATGCGCAAG GCGAGATGAT CATCCAGTTC TTCGGCAAGC GGAAGGAAGG GTCCGACGAA CGCACCGAGT GGCGCGAGAT CATGGAAAAC CTGCCGCGGG CAGCCAGTGT CGCCGCATAA
|
Protein sequence | MTEQTRPAPA EIRAFRAENP KMRERDIAAQ LKISEAALVA AETGISVTRI DGSALKLLER VASLGEVMAL SRNESAVHEK IGVFENIKSG VQAAIVLGEN IDLRIFPSRW EHGFAVSKKD GDQLRLSLQY FDKAGNAVHK VHLRPNSNVE AYHALVAELK LEDQSQDFVE AETADTVDET ADVSRDELRD NWSRLTDTHE FFGMLKRLKI GRQAAVRSVG DDYAWKLDSS ATAEMMHASV KSGLPIMCFV ASDGVVQIHS GPIFNVQTMG PWINIMDPTF HLHLRQDHIA ETWAVRKPTK DGHVTSLEAY NAQGEMIIQF FGKRKEGSDE RTEWREIMEN LPRAASVAA
|
| |