Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3821 |
Symbol | |
ID | 3905569 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 4580534 |
End bp | 4582363 |
Gene Length | 1830 bp |
Protein Length | 609 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637881147 |
Product | cytochrome-c oxidase |
Protein accession | YP_482900 |
Protein GI | 86742500 |
COG category | [C] Energy production and conversion |
COG ID | [COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 |
TIGRFAM ID | [TIGR02891] cytochrome c oxidase, subunit I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00155592 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0385819 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGATTC TACACCAACC GAGAGAACCG GGCGGAGCGG CGGATACCGG GACGCAGCCA GCGGATATCG TCGCCGCGCA CACCAGGCCA CGTACCCCTT TGCTCGGCTA TCTGCGGACG ACGTCCCACA AGGACATCGC CATCCTGTAC GCCGTCACGT CGTTCGGGTT CTTCCTCTTC GCCGGTGTCC TGGCCATCAT GATGCGGGCG GAACTCGCCC GCCCGGGGCT GCAGTACTTC TCCAACGAGC AGTACAACCA GTTTTTCACG ATGCACGGCA CGCTGATGCT GCTCATGTTC GCGACCCCGC TGGCGTTCGC GTTCGCGAAC TTCCTCGTGC CGCTGCAGAT CGGCGCGCCG GACGTGGCGT TCCCGCGGCT GAACGCGCTG TCCTACTGGT TCTTCCTGTT CGGCAGCCTG ACGGTGATCT TTGGGTTCCT GACCCCGAAC GGCGCCGCGT CCTTCGGCTG GTTCGCCTAC TCGCCGCTGA ACAGCAAGGT GTACTCGCCG GGGGCCGGGT CGGACCTGTG GATCGTGGGT CTCGCGGTCT CCGGTGTCGG TACCATCCTC GGCGCCGTCA ACATGATCAC GACGATCCTG ACGATGCGGG CCCCGGGCAT GACGATGTTC CGGCTGCCGA TCTTCTGCTG GACCTTCCTG GCGACCTCGA TCCTCGTGCT GATCGCCTTC CCCGTGCTCG CCGCGGCGCT GCTCGCCCTG GAGGCCGACC GGCGCTTCGG CGCGCACGTG TTCGACGCCG CCAACGGCGG CGCGCTGCTC TGGCAGCACC TGTTCTGGTA CTTCGGCCAC CCCGAGGTCT ACATCATCGC CCTGCCGTTC TTCGGCGTCA TCAGCGAGAT CCTGCCGGTC TTCTCCCGCA AGCCGCTGTT CGGCTACAAG GGTCTGGTCT TCGCCACCAT CGGCATCGCA GCCCTGTCCG TCGTGGTGTG GGCGCACCAC ATGTTCGTCA CCGGCGCGGT GCTACTGCCC TTCTTCGCGC TGATGTCGTT CCTCATCGCG GTACCGACCG GGATCAAGTT CTTCAACTGG ATCGGCACGA TGTGGCGCGG GCAGCTCACC TTCGAGACGC CGATGCTGTT CGCGATCGGT TTCCTGGTGA CCTTCCTGTT CGGTGGTCTG ACCGGGGTAC TGCTGGCCAG CCCGCCGATC GACTTCCACG TCAGCGACAG CTACTTCGTC GTCGCCCACT TCCACTACGT CGTCTTCGGG ACCGTGGTGT TCGCCGCCTA CGGCGGCACC TACTTCTGGT TCCCGAAGGT CACGGGCCGG CTGATGAACG ACCGGCTCGG GAAGATCCAC TTCTGGACGG TCTTCCTCGG CTTCCACACG ACGTTTCTGG TGCAGCACTG GCTCGGCGTG CAGGGTATGC CCCGCCGGTA CGCCGACTAC GGACCGAACG ACGGGTTCAC CACGCTGAAC ACGATCTCGT CCGCGGGTTC GTTCCTGCTC GCCCTCTCGA CGCTGCCGTT CATCTACAAC CTCTGGCACT CCTACCGCAA GGGCCCACTC GCCGTCGTCG ACGACCCCTG GGGCTACGGG AACTCGCTGG AATGGGCGAC CTCCTGCCCC CCGCCGCGGC ACAACTTCCG GACGCTGCCG CGCATCCGCT CCGAACGCCC GGCGTTCGAC CTGCACTATC CCCAGGCAGC CGGCCGCATC GACTATCATG CGACCCCCGA GATCACACTG ACACCCGAGA CCACGCCGAG GCCCGAGACC GCGGACCCGG CCGAATCCAC AGCGGCCGAA TCCACAGCGG CCGGACCCGA CCGCCCGGCG CCGGAATCCG GCTTCAGGAC GCCGGAATAA
|
Protein sequence | MTILHQPREP GGAADTGTQP ADIVAAHTRP RTPLLGYLRT TSHKDIAILY AVTSFGFFLF AGVLAIMMRA ELARPGLQYF SNEQYNQFFT MHGTLMLLMF ATPLAFAFAN FLVPLQIGAP DVAFPRLNAL SYWFFLFGSL TVIFGFLTPN GAASFGWFAY SPLNSKVYSP GAGSDLWIVG LAVSGVGTIL GAVNMITTIL TMRAPGMTMF RLPIFCWTFL ATSILVLIAF PVLAAALLAL EADRRFGAHV FDAANGGALL WQHLFWYFGH PEVYIIALPF FGVISEILPV FSRKPLFGYK GLVFATIGIA ALSVVVWAHH MFVTGAVLLP FFALMSFLIA VPTGIKFFNW IGTMWRGQLT FETPMLFAIG FLVTFLFGGL TGVLLASPPI DFHVSDSYFV VAHFHYVVFG TVVFAAYGGT YFWFPKVTGR LMNDRLGKIH FWTVFLGFHT TFLVQHWLGV QGMPRRYADY GPNDGFTTLN TISSAGSFLL ALSTLPFIYN LWHSYRKGPL AVVDDPWGYG NSLEWATSCP PPRHNFRTLP RIRSERPAFD LHYPQAAGRI DYHATPEITL TPETTPRPET ADPAESTAAE STAAGPDRPA PESGFRTPE
|
| |