Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5020 |
Symbol | |
ID | 6978114 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 665939 |
End bp | 667510 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643394165 |
Product | cytochrome c oxidase accessory protein CcoG |
Protein accession | YP_002278983 |
Protein GI | 209547065 |
COG category | [C] Energy production and conversion |
COG ID | [COG0348] Polyferredoxin |
TIGRFAM ID | [TIGR02745] cytochrome c oxidase accessory protein FixG |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATATCT ACACAGCACC AGATCCCAGC CCGGTCGAAC GGATAGACGC GGAACCCGTG AACGCCCGCG GCAAACGTGA ACCACTTTAC GCTGCCCGAA AGAAGGTATT TCCCAAGCGC GCCGAAGGGC GCTTTCGGCG TTTCAAGTGG ATCGTGATGA TGATCACACT TGGGATCTAC TACCTGGCTC CTTGGATTCG TTGGGATCGG GGACCCTACG CGCCGGATCA GGCTATCCTG GTGGACCTGG CGTCACGGCG CTTTTTCTTC TTCTTCATCG AGATCTGGCC GCAGGAATTC TATTACGTAG CGGGCCTGCT TGTCATGGCA GGCTTCGGTC TCTTTCTCGT CACATCTGCC GTCGGGCGGG CGTGGTGCGG ATATGCTTGT CCGCAAACCG TCTGGGTCGA TCTGTTCCTT GTCGTCGAGC GGGCGATCGA GGGAGACCGC AACGCCCGCA TGAAGCTTGA TGCCGGGCCG TGGACTTTCG ACAAGGCAAG AAAAAGGGTC GTCAAGCACG CAATCTGGCT GATGATCGGC GTCGCGACAG GGGGGGCTTG GATCTTCTAT TTCGCCGACG CGCCGACGCT TGTCGTCTCG CTTTTCACAG CGAAGGCACC TGTCGTGGCC TACGCGACCG TCGCTACTCT CACTGCGACC ACCTACGTCC TGGGCGGGCT GATGCGCGAA CAGGTCTGTA CGTACATGTG CCCGTGGCCG CGAATCCAGG GCGCGATGCT CGATGAAAAT TCTCTTGTCG TTACGTACAA CGATTGGCGC GGCGAGGCCC GCTCACGCCA CGCCAAGAAG ATCTTGGCTG CCGGTCAATC GGTCGGCGAT TGTGTCGACT GCAACGCCTG CGTGGCCGTC TGCCCCATGG GTATCGACAT CCGCGACGGA CAGCAGATGG AGTGTATTAC CTGCGCCTTG TGCATCGACG CCTGCGATGG CGTCATGGAC AAGGTGGGAA AGCCGCGTGG CCTCATAGCG TATGCGACGC TCAGCGAATA CGCGGCCAAC ATGGCCATCG CGACGGACGA CGGAAGAACG CCGGTCCAGC CCACGAAGGT GCGGAATGCC GACGGTTCGT TCGTGGAGGC CATCCGGCAT TTCGACTGGC GCATCATATT CCGTCCACGC GTACTGTTTT ATGCAGTCAC CTGGCTTTTG ATCGGCGTCG CCATGGTGGT CCATCTCGCA ATGCGCGAGC GACTGGAGCT CAACGTGGTT CACGATCGAA ACCCGCAATA CGTCTTGGAG AGCGACGGCT CCATCCGCAA CGGCTACACG CTGCGAATAC TGAATATGGT TCCCGCTCCT CGCAGGATCG AGCTCAGCCT TTTAGGCCTC GACGACGGCG CCAGCATGCG CATTCCTGAA CTGACCAAGC AGGACGCGCG GACCTTTATC ATCGAAGCTG CGCCGGACGT CGCGACAACA GTCAAGGTCT TCGTGACGAG CAAACAGTCG ACCGCCGCGA TCAGCGAGTT CCTTTTTGCC ATCGAGGACT CCGGACATTC GGATAGGGCG ACCTATCGCG CCGCATTCAA CACACCGGGA GATGCCAAAT GA
|
Protein sequence | MNIYTAPDPS PVERIDAEPV NARGKREPLY AARKKVFPKR AEGRFRRFKW IVMMITLGIY YLAPWIRWDR GPYAPDQAIL VDLASRRFFF FFIEIWPQEF YYVAGLLVMA GFGLFLVTSA VGRAWCGYAC PQTVWVDLFL VVERAIEGDR NARMKLDAGP WTFDKARKRV VKHAIWLMIG VATGGAWIFY FADAPTLVVS LFTAKAPVVA YATVATLTAT TYVLGGLMRE QVCTYMCPWP RIQGAMLDEN SLVVTYNDWR GEARSRHAKK ILAAGQSVGD CVDCNACVAV CPMGIDIRDG QQMECITCAL CIDACDGVMD KVGKPRGLIA YATLSEYAAN MAIATDDGRT PVQPTKVRNA DGSFVEAIRH FDWRIIFRPR VLFYAVTWLL IGVAMVVHLA MRERLELNVV HDRNPQYVLE SDGSIRNGYT LRILNMVPAP RRIELSLLGL DDGASMRIPE LTKQDARTFI IEAAPDVATT VKVFVTSKQS TAAISEFLFA IEDSGHSDRA TYRAAFNTPG DAK
|
| |