Gene Franean1_1802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1802 
Symbol 
ID5670204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2164670 
End bp2166376 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content66% 
IMG OID641240723 
Productcytochrome c oxidase subunit I type 
Protein accessionYP_001506146 
Protein GI158313638 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID[TIGR02891] cytochrome c oxidase, subunit I 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.115752 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCTTC TGCACGAGTC GCCCGCCGGG CCGGCCGGGC ACGCCCATGA GCCGGTGAAC 
CACGAGAAGC CGGCGATCGC GAACCTGCTC GGATACCTGA GGACGACGTC CCACAAGGAC
ATCGCGATCA TGTACTTCCT GACGTCATTC GCGTTCTTCG TCATCGCCGG CATCCTCGCG
ATCTTCATCC GTGGCGAGCT GGCCCGCCCG GGGCTGCAGT ACTTCTCCAA CGACCAGTAC
AACCAGTTCT TCACCATGCA CGGCACGCTG ATGCTGCTGA TGTTCGCGAC GCCGCTGGCG
TTCGCGTTCG CCAACTACCT CGTGCCGCTG CAGATCGGCT CGCCGGACGT CGCGTTCCCG
CGGCTGAACG CGCTGTCGTA CTGGCTGTTC CTGTTCGGCA GCCTCACCGT GTTCGCCGGG
TTCCTGAGCC CGGGGGGTGC GGCCTCGTTC GGCTGGTTCG CCTACGCGCC GCTGAGCAAC
CAGCTCTACT CGCCCGGGGT CGGTTCCGAC CTGTGGGTGC TCGGCCTGAC GGTGCAGGGG
CTCGGCACGA TCCTCGGTGC CGTCAACTTC ATCACCACGA TCCTGTGCCT GCGCGCCCCC
GGCATGACGA TGTTCCGGAT GCCGATCTTC TGCTGGAACC TGCTGGTCAC CTCGATCCTG
GTGCTGGTGG CGTTCCCGGT GCTGGCCGCC GCCCTGCTCG CGCTCGCGGC GGACCGAAGG
TTCGGGGCCC ACATCTTCGA CGCCGAGAAC GGCGGCGCGA TGCTGTGGCA GCACCTGTTC
TGGTTCTTCG GGCATCCCGA GGTCTACATC ATCGCGCTGC CGTTCTTCGG GATCGTCACC
GAGATCATCC CGGTGTTCTC CCGCAAGCCG CTGTTCGGCT ACAAGGGCCT GGTGTTCGCC
ACCATCAGCA TCGGGGCCCT GTCCGTCGCG GTGTGGGCGC ACCACATGTT CGTCACCGGC
GCGGTGCTGC TGCCCTTCTT CGCCTTCCTG TCGTTCCTGA TCGCGGTGCC CACCGGCATC
AAGTTCTTCA ACTGGATCGG CACGATGTGG CGCGGGCAGA TCAGCTTCGA GGCGCCGATG
CTGTTCGCGG TCGGCTTCCT GGTGACGTTC CTGTTCGGTG GTCTCACCGG TGTGCTGCTG
GCCAGCCCGC CGATCGACTT CCATGTCAGT GACAGCTACT TCGTGGTCGG ACACTTCCAC
TACGTCGTCG CCGGCCTCAT GTTCGCGGCC TTCGCGGGTG TCTACTTCTG GTTCCCGAAG
GTCACGGGCC GGATGCTGAA CGAGAAGCTG GCGAAGCTGC ACTTCTGGAC GCTGTTCCTG
GGCTTCAACG CCACCTTCCT GGTGTTCCAC TGGCTGGGCA CCCAGGGCAT GCCCCGGCGG
TTCGCGGACT ACGGCCCGAA CGACGGCTTC ACCACCCTGA ACACGGTCGC GACGGCGGGC
TCGTTCCTGA TGGGCCTGTC CACGCTGCCG CTGCTGTACA ACCTGTGGTA CTCGTACCGC
AAGGGCCCGA TCGCCGCGGT CGACGACCCG TGGGGCTACT CCAACTCGCT GGAGTGGGCG
ACGTCCTGCC CGCCGCCCCG GCACAACTTC CGGTCGCTGC CGCGCATCCG CTCCGAGCGA
CCCGCGTTTG ACCTGCACTA CCCGGCGGTC GTCGGTAGGG CGGATTACCA CGCCACACCG
GAACTCGGTC AGGGAGCCGC GCGATGA
 
Protein sequence
MTLLHESPAG PAGHAHEPVN HEKPAIANLL GYLRTTSHKD IAIMYFLTSF AFFVIAGILA 
IFIRGELARP GLQYFSNDQY NQFFTMHGTL MLLMFATPLA FAFANYLVPL QIGSPDVAFP
RLNALSYWLF LFGSLTVFAG FLSPGGAASF GWFAYAPLSN QLYSPGVGSD LWVLGLTVQG
LGTILGAVNF ITTILCLRAP GMTMFRMPIF CWNLLVTSIL VLVAFPVLAA ALLALAADRR
FGAHIFDAEN GGAMLWQHLF WFFGHPEVYI IALPFFGIVT EIIPVFSRKP LFGYKGLVFA
TISIGALSVA VWAHHMFVTG AVLLPFFAFL SFLIAVPTGI KFFNWIGTMW RGQISFEAPM
LFAVGFLVTF LFGGLTGVLL ASPPIDFHVS DSYFVVGHFH YVVAGLMFAA FAGVYFWFPK
VTGRMLNEKL AKLHFWTLFL GFNATFLVFH WLGTQGMPRR FADYGPNDGF TTLNTVATAG
SFLMGLSTLP LLYNLWYSYR KGPIAAVDDP WGYSNSLEWA TSCPPPRHNF RSLPRIRSER
PAFDLHYPAV VGRADYHATP ELGQGAAR