Gene Franean1_4816 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4816 
Symbol 
ID5673157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5749661 
End bp5751199 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content68% 
IMG OID641243672 
ProductBilirubin oxidase 
Protein accessionYP_001509088 
Protein GI158316580 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2132] Putative multicopper oxidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.237697 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.101949 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCGTC TGCTGAGCCG ACGAAACGTG CTGGGCGCCT GGGTCGCCGC GACCGCGGCG 
GGCACCGTCC GGCTGGCCAC GGCCGGCGGC GGCGGTGACG GCGGTGCCTC GCTGGCCTCC
GCCAGTGCGT CAACCGACCA CGCAGCCCAT CTCGGTCACT CCGGAACCGT CGAGACGGTC
GCGAACGAGG TGACCGCGGC CCCGCCGGTG ACGCCGTTCT CCGTTCCGCT GCCGGTCCCG
CCGACGCTCG CGCCGACGTC CACAACCGCG GACACGGACT TCTACGAGAT AACCAGCCGG
ACCGCGGCGG AGGAGATCCT GCCCGGTTTC TCCACCGAGA TCCACGGCTA CGGCGGGAGC
TATGTCGGGC CGACGATAAA GGCGAAGGTC GGCCGCAAGG TTGTTGTCCG GCACACCAAC
CAGCTGCCCC ACCCGACCGC CGTCCACCTG CACGGGGCAC ATGTCCCGGC GTCCAGCGAC
GGCTTTCCGA CCGATGTCAT CGACACCGGT GCGACGAAGA CCTACGAGTA CCCGAACAAC
CAGCCGGCCG CGACCCTGTG GTATCACGAC CATGCGCATC ATCACGAGAC CGAGAACATC
TACCGGGGCC TGCACGGCTT CTACCTGCTG GAGGATCCGG CCGAGGCGGC ACTGAACCTT
CCGAGCGGGG AGTTCGACAT TCCCGTCCTG CTGTCCGACG CCAACATCGA CGCCGACGGC
CAGCTCGTCT TCATCAGCCT GGATGAGTAC GGACGGACGA CGCTGCTGAC GAACGGCCGC
CCCGCCCCGT ACTTTCAGGT GCAGGCCCGC CGCTACCGGT TCCGGTTCCT GAACGCCAGC
AACATGCGGG TGTTCAACCT GGTACTGGGC CGCCAGGAGA GGTTCACTCA GATCGCGGGG
GACGGCGGCC TGCTGCCGGC GCCGGCGACC GTCGACAAGG TCCTGCTCAG TGCCGGTGAG
CGGGCCGAGA TCGTCATCGA CTTCTCCAGG TACGCGCCCG GCACCCAGCT CGTCCTGAAC
AACACCCGGA ACCTCAACGA TCCGAGCCGG CCGGTCGCCC TGCCCGTGCT GCGCTTCGAC
GTGGTCGCCC GGGTGGGGCA GGACACCAGC GCCGTCCCGG CCACCCTGCG AGCGCATCCG
CCGCTGCCCG CCGCGACCCG GGTCCGGGAC GTGGTCCTCT CGTACCACAC CGCGGCCTTC
CCGTTCGCGA TCAACGGGAT GCTGTTCGAC CCCAACCGCG TCGACTACAC CATTCGCCGC
GGCAGCACCG AGATCTGGCG GATCACCAAC CCGGTCACCC CGGTGGCGGT CTACCACAAC
CTGCACCTGC ATCTGGTGCA GTTCCGGATC CTGGACCGCG ACGGCGTACC GCCCGGTCCG
GCCGAGTCGG GCTGGAAGGA CACCGTGTTC GTGGCGCCGG GCGAGACGGT CCGCATCCAG
GCGACGTTCA CCGAGTATCT CGGTAAGTAC CTCTACCACT GCCACGTCCA GGATCACGCC
GGCACCGGGA TGATGGCACT GTTCGAGACT GTGGCCTGA
 
Protein sequence
MSRLLSRRNV LGAWVAATAA GTVRLATAGG GGDGGASLAS ASASTDHAAH LGHSGTVETV 
ANEVTAAPPV TPFSVPLPVP PTLAPTSTTA DTDFYEITSR TAAEEILPGF STEIHGYGGS
YVGPTIKAKV GRKVVVRHTN QLPHPTAVHL HGAHVPASSD GFPTDVIDTG ATKTYEYPNN
QPAATLWYHD HAHHHETENI YRGLHGFYLL EDPAEAALNL PSGEFDIPVL LSDANIDADG
QLVFISLDEY GRTTLLTNGR PAPYFQVQAR RYRFRFLNAS NMRVFNLVLG RQERFTQIAG
DGGLLPAPAT VDKVLLSAGE RAEIVIDFSR YAPGTQLVLN NTRNLNDPSR PVALPVLRFD
VVARVGQDTS AVPATLRAHP PLPAATRVRD VVLSYHTAAF PFAINGMLFD PNRVDYTIRR
GSTEIWRITN PVTPVAVYHN LHLHLVQFRI LDRDGVPPGP AESGWKDTVF VAPGETVRIQ
ATFTEYLGKY LYHCHVQDHA GTGMMALFET VA