Gene Francci3_3821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3821 
Symbol 
ID3905569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4580534 
End bp4582363 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content66% 
IMG OID637881147 
Productcytochrome-c oxidase 
Protein accessionYP_482900 
Protein GI86742500 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID[TIGR02891] cytochrome c oxidase, subunit I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00155592 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0385819 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGATTC TACACCAACC GAGAGAACCG GGCGGAGCGG CGGATACCGG GACGCAGCCA 
GCGGATATCG TCGCCGCGCA CACCAGGCCA CGTACCCCTT TGCTCGGCTA TCTGCGGACG
ACGTCCCACA AGGACATCGC CATCCTGTAC GCCGTCACGT CGTTCGGGTT CTTCCTCTTC
GCCGGTGTCC TGGCCATCAT GATGCGGGCG GAACTCGCCC GCCCGGGGCT GCAGTACTTC
TCCAACGAGC AGTACAACCA GTTTTTCACG ATGCACGGCA CGCTGATGCT GCTCATGTTC
GCGACCCCGC TGGCGTTCGC GTTCGCGAAC TTCCTCGTGC CGCTGCAGAT CGGCGCGCCG
GACGTGGCGT TCCCGCGGCT GAACGCGCTG TCCTACTGGT TCTTCCTGTT CGGCAGCCTG
ACGGTGATCT TTGGGTTCCT GACCCCGAAC GGCGCCGCGT CCTTCGGCTG GTTCGCCTAC
TCGCCGCTGA ACAGCAAGGT GTACTCGCCG GGGGCCGGGT CGGACCTGTG GATCGTGGGT
CTCGCGGTCT CCGGTGTCGG TACCATCCTC GGCGCCGTCA ACATGATCAC GACGATCCTG
ACGATGCGGG CCCCGGGCAT GACGATGTTC CGGCTGCCGA TCTTCTGCTG GACCTTCCTG
GCGACCTCGA TCCTCGTGCT GATCGCCTTC CCCGTGCTCG CCGCGGCGCT GCTCGCCCTG
GAGGCCGACC GGCGCTTCGG CGCGCACGTG TTCGACGCCG CCAACGGCGG CGCGCTGCTC
TGGCAGCACC TGTTCTGGTA CTTCGGCCAC CCCGAGGTCT ACATCATCGC CCTGCCGTTC
TTCGGCGTCA TCAGCGAGAT CCTGCCGGTC TTCTCCCGCA AGCCGCTGTT CGGCTACAAG
GGTCTGGTCT TCGCCACCAT CGGCATCGCA GCCCTGTCCG TCGTGGTGTG GGCGCACCAC
ATGTTCGTCA CCGGCGCGGT GCTACTGCCC TTCTTCGCGC TGATGTCGTT CCTCATCGCG
GTACCGACCG GGATCAAGTT CTTCAACTGG ATCGGCACGA TGTGGCGCGG GCAGCTCACC
TTCGAGACGC CGATGCTGTT CGCGATCGGT TTCCTGGTGA CCTTCCTGTT CGGTGGTCTG
ACCGGGGTAC TGCTGGCCAG CCCGCCGATC GACTTCCACG TCAGCGACAG CTACTTCGTC
GTCGCCCACT TCCACTACGT CGTCTTCGGG ACCGTGGTGT TCGCCGCCTA CGGCGGCACC
TACTTCTGGT TCCCGAAGGT CACGGGCCGG CTGATGAACG ACCGGCTCGG GAAGATCCAC
TTCTGGACGG TCTTCCTCGG CTTCCACACG ACGTTTCTGG TGCAGCACTG GCTCGGCGTG
CAGGGTATGC CCCGCCGGTA CGCCGACTAC GGACCGAACG ACGGGTTCAC CACGCTGAAC
ACGATCTCGT CCGCGGGTTC GTTCCTGCTC GCCCTCTCGA CGCTGCCGTT CATCTACAAC
CTCTGGCACT CCTACCGCAA GGGCCCACTC GCCGTCGTCG ACGACCCCTG GGGCTACGGG
AACTCGCTGG AATGGGCGAC CTCCTGCCCC CCGCCGCGGC ACAACTTCCG GACGCTGCCG
CGCATCCGCT CCGAACGCCC GGCGTTCGAC CTGCACTATC CCCAGGCAGC CGGCCGCATC
GACTATCATG CGACCCCCGA GATCACACTG ACACCCGAGA CCACGCCGAG GCCCGAGACC
GCGGACCCGG CCGAATCCAC AGCGGCCGAA TCCACAGCGG CCGGACCCGA CCGCCCGGCG
CCGGAATCCG GCTTCAGGAC GCCGGAATAA
 
Protein sequence
MTILHQPREP GGAADTGTQP ADIVAAHTRP RTPLLGYLRT TSHKDIAILY AVTSFGFFLF 
AGVLAIMMRA ELARPGLQYF SNEQYNQFFT MHGTLMLLMF ATPLAFAFAN FLVPLQIGAP
DVAFPRLNAL SYWFFLFGSL TVIFGFLTPN GAASFGWFAY SPLNSKVYSP GAGSDLWIVG
LAVSGVGTIL GAVNMITTIL TMRAPGMTMF RLPIFCWTFL ATSILVLIAF PVLAAALLAL
EADRRFGAHV FDAANGGALL WQHLFWYFGH PEVYIIALPF FGVISEILPV FSRKPLFGYK
GLVFATIGIA ALSVVVWAHH MFVTGAVLLP FFALMSFLIA VPTGIKFFNW IGTMWRGQLT
FETPMLFAIG FLVTFLFGGL TGVLLASPPI DFHVSDSYFV VAHFHYVVFG TVVFAAYGGT
YFWFPKVTGR LMNDRLGKIH FWTVFLGFHT TFLVQHWLGV QGMPRRYADY GPNDGFTTLN
TISSAGSFLL ALSTLPFIYN LWHSYRKGPL AVVDDPWGYG NSLEWATSCP PPRHNFRTLP
RIRSERPAFD LHYPQAAGRI DYHATPEITL TPETTPRPET ADPAESTAAE STAAGPDRPA
PESGFRTPE