Gene Francci3_2493 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2493 
Symbol 
ID3904871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2939930 
End bp2941747 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content64% 
IMG OID637879823 
Productcytochrome-c oxidase 
Protein accessionYP_481589 
Protein GI86741189 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID[TIGR02891] cytochrome c oxidase, subunit I 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.526922 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.741402 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGGAG GAACTCGTGC CCGACGTGGC CGTGTCTGGT CGGATGGCCC GATGACGGTC 
GAGGTCCGTC CTTTTCCGCC GAGTGTCGGG GAGCAGCGAG GGGAGCCGCC GACGGCCCTG
GACATCCTCG GTACGACCGA TCACAAGTCG ATCGGTCTGA TGTATCTGGT GACCTCGTTC
GGGTTCTTCC TATTCGCCGG GATCCTCGCG ATGCTAATGC GGGGGGAACT GGCCCGGCCC
GGTCTACAGA TCGTCTCGAA CGAGCGGTAC AACCAGTTTT TTACGATCCA CGGCACGGTG
ATGATGCTGT TGTTCGCGAC CCCGCTGTTC TCCGCGTTCG CGAACTATCT TGTCCCGTTG
CAGATCGGGG CGCCTGACAT CGCCTTCCCC CGGTTGAACG CCTTGGGCTA CTGGCTGTAT
CTGGTCGGCG GTCTGATCGT CGTCTCCGGG TTCCTGACCC GGGGCGGGGC GGCGGACTTC
GGCTGGTTCG CGTACTCCCC GTTGACCTCG GCGCAGTACT CGCCGCACCT CGGCGGTGAT
CTGTGGGCGG TGGGCCTGGT GATGTCCGGG CTGAGCACCG TGCTGGCGAG CGTCAACTTC
ATCACCACGA TCATCACGCT GCGCGCGCCG GGCATGCTCC TGTTCCGAAT GCCGATCTTC
ACCTGGAACG TGCTGATTAC GTCGGTGCTG GCGCTGCTCG CGTTCCCGGT ACTCACCGCG
GCCCTGCTGG CGCTGGAGGC CGATCGCCGG TTCGGCGCCC ACGTGTACGA CGCGAGCAAC
GGAGGAGCGA TCCTCTGGCA GCACCTTTTC TGGTTCTTCG GTCATCCCGA GGTCTACATC
GTCGCGCTGC CCTTCTTCGG CATCGTCACG GAGATCCTGC CGGTGTTCAG CCGTAAACCG
ATTTTCGGGT ACCGGGGGTT GGTCTTCGCG ACCGTGGGAA TCGGCGCGCT GTCGATGGCG
GTGTGGGCGC ACCACATGTT CGCGACCGGT GCCGTGCTGC TGCCGTTCTT CTCTCTGCTG
TCCTTTCTGA TCGCCGTCCC GACCGGGGTG AAGTTCTTCA ACTGGATCGG CACCATGTGG
CGCGGAACGA TCATCCTGAA CACGCCAATG CTGTTCTCGA TCGGATTCCT TATCACCTTC
CTGTTCGGTG GGCTGACCGG GGTCACCCTG GCCAGCCCAC CCATCGACTT TCACGTCACC
GACAGCTATT ACGTCGTGGG TCATCTGCAC TATGTCCTGT TCGGGACAAT CGTCTTCTCC
GCGTTCGCCG GTGTGTACTT CTGGTTCCCC AAGGTGACGG GAAGGTTCCT GAACGAGAAG
TTGGGGAAGC TGCACTTCTG GAGTCTTCTC GTCGGTTTCC ACGTGACGTT TCTGCCGATG
CACTGGCTGG GTGTCAGGGG AATGCCGCGG CGGGTCGCCG ACTACCTGCC CACCGACGGT
TTCACCACGC TGAACACGAT CGAGAGTGCC GGCTCCTTCC TGCTCGGCCT GTCCACCCTA
CCGTTTCTGT ACAATGTTTG GCGGTCGCTG CGGCATGGTG ACCCGGCGAC CAGTGGTGAC
GCCTGGGGCT ACGGGAACTC GTTGGAATGG GCGACGTCCT GCCCACCACC GAAGCACAAC
TTTCACCAGA TACCCCGTAT CCGATCCGAG CGGCCGGCCT TCGACCTGAA CCACCCCCGG
CAGGGGCCGG AGGGGCGGGA AGGTCGCCAG CGGCAGCCGC GCCGGGTCGA CCAGAGGCAC
CGGCGCGAGC TGGCCTCCGT CGAGGGGGAG ACGGCCTCGC GGCGTGATAC CGCGGTGGAC
CCGGACGCGG CCGGGTGA
 
Protein sequence
MRGGTRARRG RVWSDGPMTV EVRPFPPSVG EQRGEPPTAL DILGTTDHKS IGLMYLVTSF 
GFFLFAGILA MLMRGELARP GLQIVSNERY NQFFTIHGTV MMLLFATPLF SAFANYLVPL
QIGAPDIAFP RLNALGYWLY LVGGLIVVSG FLTRGGAADF GWFAYSPLTS AQYSPHLGGD
LWAVGLVMSG LSTVLASVNF ITTIITLRAP GMLLFRMPIF TWNVLITSVL ALLAFPVLTA
ALLALEADRR FGAHVYDASN GGAILWQHLF WFFGHPEVYI VALPFFGIVT EILPVFSRKP
IFGYRGLVFA TVGIGALSMA VWAHHMFATG AVLLPFFSLL SFLIAVPTGV KFFNWIGTMW
RGTIILNTPM LFSIGFLITF LFGGLTGVTL ASPPIDFHVT DSYYVVGHLH YVLFGTIVFS
AFAGVYFWFP KVTGRFLNEK LGKLHFWSLL VGFHVTFLPM HWLGVRGMPR RVADYLPTDG
FTTLNTIESA GSFLLGLSTL PFLYNVWRSL RHGDPATSGD AWGYGNSLEW ATSCPPPKHN
FHQIPRIRSE RPAFDLNHPR QGPEGREGRQ RQPRRVDQRH RRELASVEGE TASRRDTAVD
PDAAG