Gene Franean1_6580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6580 
Symbol 
ID5674895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8005680 
End bp8007245 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content71% 
IMG OID641245431 
Productcytochrome bd ubiquinol oxidase subunit I 
Protein accessionYP_001510823 
Protein GI158318315 
COG category[C] Energy production and conversion 
COG ID[COG1271] Cytochrome bd-type quinol oxidase, subunit 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCCATG CCGCCGAAGT TCTCTCCCAC GTCGCGGCTC TCGCCCAGGC CGCGGACCCG 
CCCCAGGCGA GCTCCGTGTC CACCAACCTG GCCCGGGCGC AGACCGCGTT CTCGCTGGCG
TTCCACATCT GCTTCGCGGT GTTCGGCGTC GGCATGCCAT GGCTCCTGCT GTTCGTCGAG
GGGCGCTGGG TCCGCACCCG CGACCCCGTC TGGCTCGCGC TGACCCGCAA GTGGTCCAGG
GCCTTCGCGG TCGTGTTCGC CGTCGGCGCC GTCTCCGGGA CGGCGCTGTC CTTCGAGTTC
GGCCTGCTGT GGCCGGCCTT CATGGCCCGG TACGGCGGGG TGCTGGGACT GTCGTTCACG
CTGGAGGGCT TCGCGTTCTT CGCCGAGGCG ATCTTCATCG GCATGTACCT GTACGGCTGG
AAGCGCCTGT CGGCCCGGGC GCACTGGCTC ACGCTGTGGC CGATCGCCAT CGCCGGCACG
TTCTCGACTC TCTTCATCAT CACGGCGAAC GCCTGGATGA ACACCCCCGG ACACGTCACA
GAGGTCGACG GCAAGGTCGT ATCGGCAGAG CCGTTCGCCC CGTTCCTGGC TGCCACCGCG
CCGCATCAGC TCATCCACAT GCTGCTGGCC GCGCTGATGT GCACCGGCGG CATCGTCGCC
GGCGTGTACG CCGTCGGCAT GCTGCGCGGC CGGCGCGACG CGTATCACCA GCGGGGACTG
CGCGTCGGTC TGGCCGTCCT GCTCGTCTGT GCCCCCCTGC AACTGATCGT CGGTGACTGG
GCCGCCCGGG TGGTCGGCAA CGAGCAGCCG ATCAAGCTCG CCGCGATGGA GGGGCTCGGG
CACACCCGCA CGCACGCCCC GCTGACCATC GGCGGGATCT ACGACGAGCA GACCGGCGAG
GTCCGGCACG GCATCGAGGT CCCCGACCTG CTGTCGCTCA TGCAGGGGTT CAGCGCCGAT
CACGAGATCA CCGGCCTGAC CGCCGTCCCG CCCGAGGAGC GGCCGAACGC CGTGCTGGTG
CACTCGGCCT TCAACGTGAT GGTCGGCCTC GGGATGGCCC TGATCGCGTT GTCGCTGGTC
ACCGGGGTCG TCGTCCTGCG CCGGCGGCGG CGCGGCGAGC GCCCGCTGCT GCCCACCGGA
AAACCGTGGC TCTGGGCGGC CGCGGCCTCG GGACCGGCGT CCATGCTGGC GATGCTCGCT
GGCTGGGAGG TCACCGAGGG CGGCCGCCAG CCCTGGATCG TCTACGGCCG GATGCGGGTG
GATGAGGCCG TGACCAGCAG CTCGGGCATG CCGGCGATCT TCACTGGCAC GATCCTGCTC
TACCTCGGCC TCGCGACGGC GCTCATCCTC ATCCTGCGCC GGATGGCGAC GGGTGGCCCT
GAGCTGGCCG CCACCGTGGG GGAGACCTCC ACGACTCCGG ACATCCCGGA CGCGCGTCCC
GCTCCCGCCT CTCCCGACAC TCCCGCGAGG GTCCGCGGTG ACGGCGACGG TGGCGACGGG
CCCGGGACGT CCGCGCCCAC CACACCGCGA ACCGATCCGT CCAGCAGCGA AGGGGACGCC
CGATGA
 
Protein sequence
MVHAAEVLSH VAALAQAADP PQASSVSTNL ARAQTAFSLA FHICFAVFGV GMPWLLLFVE 
GRWVRTRDPV WLALTRKWSR AFAVVFAVGA VSGTALSFEF GLLWPAFMAR YGGVLGLSFT
LEGFAFFAEA IFIGMYLYGW KRLSARAHWL TLWPIAIAGT FSTLFIITAN AWMNTPGHVT
EVDGKVVSAE PFAPFLAATA PHQLIHMLLA ALMCTGGIVA GVYAVGMLRG RRDAYHQRGL
RVGLAVLLVC APLQLIVGDW AARVVGNEQP IKLAAMEGLG HTRTHAPLTI GGIYDEQTGE
VRHGIEVPDL LSLMQGFSAD HEITGLTAVP PEERPNAVLV HSAFNVMVGL GMALIALSLV
TGVVVLRRRR RGERPLLPTG KPWLWAAAAS GPASMLAMLA GWEVTEGGRQ PWIVYGRMRV
DEAVTSSSGM PAIFTGTILL YLGLATALIL ILRRMATGGP ELAATVGETS TTPDIPDARP
APASPDTPAR VRGDGDGGDG PGTSAPTTPR TDPSSSEGDA R