Gene Franean1_0895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0895 
Symbol 
ID5669309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1044613 
End bp1046298 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content65% 
IMG OID641239822 
Productcytochrome c oxidase subunit I type 
Protein accessionYP_001505257 
Protein GI158312749 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID[TIGR02891] cytochrome c oxidase, subunit I 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.822333 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.123512 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTATTC TGCGCGAGCC GTCCGGCCAC GCCGTCGAAC ACGCGGAAGC CGGACACTCC 
CGGCCACGCA CGAACATGCT AGGATATCTT CGCACCACCT CGCACAAGGA TATCGCCGTC
CTGTACGCGG TGACGTCGTT CGGGTTCTTC ATCCTCGCCG GGATCCTGGC CATGATGATG
CGTGCCGAGC TGGCACGGCC GGGTCTGCAG TACTTCTCGA ACGAGCAGTA CAACCAGTTT
TTCACCCTGC ACGGCACGCT CATGCTGCTG CTGTTCGCGA CGCCGCTGGC GTTCGCCTTC
GCGAACTTCC TCATACCGCT GCAGATCGGG TCGCCGGACG TCGCGTTCCC CCGGCTCAAC
GCCCTTTCGT ACTGGTTCTT CCTGTTCGGC GGGCTGATGG TCGTCGCCGG CTTCCTCACC
CCGGACGGCG CCGCGGACTT CGGCTGGTTC GCCTACGCCC CACTGAACAA CAAGACGTTC
AGCCCGTCGG TCGGGGCGGA CATGTGGATC CTGGGCCTCG TCGTCTCCGG GCTCGGGACG
ATCCTCGGCG CGGTCAACAT GATCACCACG ATACTCACCC TGCGCGGCCC CGGTATGACG
ATGTTCCGCC TGCCGATCTT CTGCTGGACG TTCCTCGTGA CGTCCGTGCT GGTGATCGTC
GCGTTCCCGG TGCTGGCTGG GGCCCTGCTG TCGCTGGAGG CCGACCGGCG CTTCGGCGCC
CACGTGTTCG ACTCGGAGAA CGGCGGCGCC ATCCTCTGGC AGCACCTGTT CTGGTTCTTC
GGGCATCCCG AGGTCTACAT CATCGCCCTG CCGTTCTTCG GCATCATCAG CGAGATCATC
CCGGTCTTCT CCCGGAAGCC GGTCTTCGGC TACAAGGGCC TGGTGTTCGC CACCATCGCC
ATCGGCGCCC TGTCGATCGT GGTCTGGGCA CACCACATGT TCGTCACCGG CGCGGTACTG
CTGCCCTTCT TCGCCGTGAT GTCGTTCCTG ATCGCCGTCC CGACCGGTAT CAAGTTCTTC
AACTGGATCG GGACGATGTG GCGGGGCAAG CTGAGCTTCG AGACCCCGAT GATGTTCTGC
CTCGGGTTCC TCGTGACGTT CCTCCTCGGC GGGCTGACCG GGGTGATGCT CGCGAGCCCG
CCGATCGACT TCCACGTCAG CGACAGCTAC TTCGTGGTCG CCCACTTCCA CTACGTGGTG
TTCGGGACGG TCGTCTTCGC GGCGTTCGCC GGCACGTACT TCTGGTTCCC CAAGCTGACC
GGCCGGATGA TGGACGACCG GCTCGGCAAG ATCCACTTCT GGACCGTCTT CCTGGGCTTC
CACCTGACGT TCCTGGTGCA GCACTACCTG GGCATGCAGG GGATGCCCAG GCGGTACGCC
GACTACGGGC CGGGCGACGG ATTCACCACG TTGAACACGA TATCGACCGC TGGCAGCTTC
CTGCTCGGGG TCTCGACACT GCCGTTCATG TACAACGTGT GGAATTCCTA CCGCCGCGGC
CGGCTCGCCG TCGTCGACGA CCCGTGGGGG TACGGGAACT CCCTCGAGTG GGCGACGTCC
TCCCCGCCGC CCCGGCACAA CTTCCACCAG CTGCCGCGCA TCCGCTCCGA GCGCCCCGCC
TTCGACCTGC ACTACCCGGA GGTCGCCGGC GTCACCGACT ACCACGCCAC CCCCGAACTG
CGCTAG
 
Protein sequence
MTILREPSGH AVEHAEAGHS RPRTNMLGYL RTTSHKDIAV LYAVTSFGFF ILAGILAMMM 
RAELARPGLQ YFSNEQYNQF FTLHGTLMLL LFATPLAFAF ANFLIPLQIG SPDVAFPRLN
ALSYWFFLFG GLMVVAGFLT PDGAADFGWF AYAPLNNKTF SPSVGADMWI LGLVVSGLGT
ILGAVNMITT ILTLRGPGMT MFRLPIFCWT FLVTSVLVIV AFPVLAGALL SLEADRRFGA
HVFDSENGGA ILWQHLFWFF GHPEVYIIAL PFFGIISEII PVFSRKPVFG YKGLVFATIA
IGALSIVVWA HHMFVTGAVL LPFFAVMSFL IAVPTGIKFF NWIGTMWRGK LSFETPMMFC
LGFLVTFLLG GLTGVMLASP PIDFHVSDSY FVVAHFHYVV FGTVVFAAFA GTYFWFPKLT
GRMMDDRLGK IHFWTVFLGF HLTFLVQHYL GMQGMPRRYA DYGPGDGFTT LNTISTAGSF
LLGVSTLPFM YNVWNSYRRG RLAVVDDPWG YGNSLEWATS SPPPRHNFHQ LPRIRSERPA
FDLHYPEVAG VTDYHATPEL R