Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5243 |
Symbol | |
ID | 6978337 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | - |
Start bp | 873957 |
End bp | 875870 |
Gene Length | 1914 bp |
Protein Length | 637 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643394355 |
Product | cytochrome c oxidase, subunit I |
Protein accession | YP_002279173 |
Protein GI | 209547255 |
COG category | [C] Energy production and conversion |
COG ID | [COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 |
TIGRFAM ID | [TIGR02891] cytochrome c oxidase, subunit I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.407043 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.600413 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACGGC ACGGTAAAAT TCCCTCCCCT GCTGATCTGG CTGCCGGCGA TGAAAAGGCC GAAGCTTTCC TTAAGGCCAC ATGGTCGACG CCAGCAGGGA TCGTGGCGGC CCTATCGACC GTCGATCACA AGATCATCGG CCGCCGCTAC ATTGCCACAG CCTTCGTCTT TCTGATGCTT GGCGGCGTCC TGGCGATGGC CATGCGCCTC CAGCTCGCAT TCCCGGAAGC CCGCTTCATC AGCGCTGACC GGTACAACCA GATCTTTACC GTGCATGGTA GCAACATGAT GTTCCTGTTC GCCGTGCCGG TGATGGAGGC GATGGCGGTC TACCTGGTTC CCCTGATGGT CGGTACCCGC AACATCGCCT TTCCCCGGCT GAATGCCTTC TCCTACTGGA TCTTTCTCGC CGGCGGGCTG CTTCTGTGGG CGTCTCTCGC GCTCGACACT GCCCCCGATG TCGGATGGTT CGCCTACGTG CCACTGGCAG GCCCGCAATA CGGCCCGGGC AAGCGGGCAG ACATTTGGGC TCAGATGATT ACCTTTACGG AAGTCTCGGC CCTTGCCGTT GCCGTCGAAA TCGTCGTGAC CGTCTTCAAA TTGCGGGCGC CTGGCATGTC GCTCGACCGG ATCCCGATCT TTGTCTGGTC GATGCTCGTC ACAGCCTTTC TGGTCATCAT GGCGATGCCA GCGATCATGC TCGTCAGTAC GTCGCTCATC CTTGATCGGC TGGTCGGCAC GCAGTTCTTC AATCCGGCAG AGGGCGGCGA TGCGCTCCTC TGGCAGCATC TGTTCTGGTT TTTCGGCCAT CCGGAGGTCT ATATCATCTT CCTGCCTGCC GTCGGCATGG TTTCGGCGAT GATCTCGACC TTCACGCAGC GCCCCGCCTT CGGCTATCTG CCGCTGGTGC TGGCGATGAT TGCGACCGGT GTTCTCGCCT TCGGATTGTG GGTGCACCAT ATGTTCGTGG CCGGCCTGCC GCGTGTAGGC AGCAGCTTCT TCACCGCCTC GAGCATGGCG ATCGCCATTC CGGCAGGCAC CCAGATCTTT TGCTGGCTGG CAACTCTGTG GGACGGCCGT CCCATCTTCA AGACGCCAAT GCTTTTCATC ATCGGCTTCC TCATCACATT CGTCATTGGC GGTTTGACTG GCGTCATGGT GGCATCGGTG CCGTTCGACA CCCAGGTTCA CGACACTTAT TTCGTCGTGG CGCATTTTCA CTATGTGCTG ATCGGCGGTT CCGTCTTTCC GCTGATCGGG GCGATCTATT ATTGGTTTCC GAAAATGACC GGGCGAATGA TGAGCGAAAG GCTTGGGCGC TGGGCCTTCG GCCTGATCTT CACCGGTTTT CATCTCACCT TCTTCCCGAT GCATATCCTC GGCCTGCAAG GCATGCCGCG GCGCGTTTAT ACCTATCCGC CAGAATTACC CTGGACGGGC CTGAACCTCT TCGTCAGCCT CAGTGCCGTC ATCCTGGCGG GTGGTTTCCT CGTCTTCTTC ATCGACGTCC TGCGCAGTTT CCGGCATGGC CCTGCCGCGG GCTCCAATCC CTGGAATGCA TCGACACTGG AATGGGCCAC GCCCTCGCCC CCGCCTCCTT ATAACTTCCG GCACATTCCC GTGGTCGAAT CCCGGGAGCC GCTCTGGACG TCAGGCGATG CACTGGCGGT GGCGACAGGT CTGCGGCTCG ACCGACGGGA ACTGATCGTC AGCAGTCTCA CAACTGCCGA TCCGGAAGCC CGGGAATCCT CGCCGGCCAA TTCCATCTGG CCATTTCTGG CCGCCATTGC CACGAGCGTC ATGCTGATTG CCTCGATCTT CAGTCCCTGG GCGGTCATCT GGGGCGCGAT CCCGGTAGCG ATCACCTTGA CCGGTTGGTT CTGGCCGAAA CGAACGCCGG AGGACGAATC ATGA
|
Protein sequence | MTRHGKIPSP ADLAAGDEKA EAFLKATWST PAGIVAALST VDHKIIGRRY IATAFVFLML GGVLAMAMRL QLAFPEARFI SADRYNQIFT VHGSNMMFLF AVPVMEAMAV YLVPLMVGTR NIAFPRLNAF SYWIFLAGGL LLWASLALDT APDVGWFAYV PLAGPQYGPG KRADIWAQMI TFTEVSALAV AVEIVVTVFK LRAPGMSLDR IPIFVWSMLV TAFLVIMAMP AIMLVSTSLI LDRLVGTQFF NPAEGGDALL WQHLFWFFGH PEVYIIFLPA VGMVSAMIST FTQRPAFGYL PLVLAMIATG VLAFGLWVHH MFVAGLPRVG SSFFTASSMA IAIPAGTQIF CWLATLWDGR PIFKTPMLFI IGFLITFVIG GLTGVMVASV PFDTQVHDTY FVVAHFHYVL IGGSVFPLIG AIYYWFPKMT GRMMSERLGR WAFGLIFTGF HLTFFPMHIL GLQGMPRRVY TYPPELPWTG LNLFVSLSAV ILAGGFLVFF IDVLRSFRHG PAAGSNPWNA STLEWATPSP PPPYNFRHIP VVESREPLWT SGDALAVATG LRLDRRELIV SSLTTADPEA RESSPANSIW PFLAAIATSV MLIASIFSPW AVIWGAIPVA ITLTGWFWPK RTPEDES
|
| |