Gene Franean1_2018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2018 
Symbol 
ID5670419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2424486 
End bp2425922 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content76% 
IMG OID641240939 
ProductFAD linked oxidase domain-containing protein 
Protein accessionYP_001506361 
Protein GI158313853 
COG category[C] Energy production and conversion 
COG ID[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID[TIGR00387] glycolate oxidase, subunit GlcD 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.796583 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.304536 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGCG GGACGGCGGG CGCGGCGGCC GCGGCGGCCG CGGGCGCGGC GTTCGCCTCG 
GCGCTGCCGG CCGGGCGGTG GAGCCAGGAC GGCGCGGTCG TCGACGCCCA CCGGCTCGAC
CGCTCCGGGT GGCGGCCGCC GGGACGTCCC CTCGGCGTGG CGTTCGCCGG CGGCGTCGAC
GACGTGCGGG CCGTCCTGCG CACGGCGCTC GCGTCGGGCA CCCCGGTGAC GGTCCGGGGC
GCCGGGACGG GGCTGGCCGG CGGCGCGGCC GCCCCGGACG GCGCGGTGAT CCTCGACGTC
AGCGGCATGA ACCGCATCCG CGAGCTGTCG GTGCCCGATG CCGTCGCCGT CGTCGAGCCC
GGGGTGATCA CCGACGATCT CGACCGGGCC GCCCGGGAGG TCGGGCTGAG CTACTCCCCC
GACCCGGCCA GCTCGGCGAT CTCCACGATC GGCGGGAACA TCGCGACCAA CGCGGGTGGC
CTGCGGTGCG CCAAGTACGG CGTGACCCGG GAGTCCGTGC TCGGCCTCGA CGTGGTCCTC
GCTGACGGGG AGCTGGTCAG CACCGGGCGG CGCACTGTGA AGGGCGTCGC CGGCTACGAC
CTGACCGGCC TGTTCGTCGG TTCGGAGGGC ACCCTCGGCG TCGTCGTCGG CGCGACGCTG
CGGCTGCGGC CGGCGCCGCG GCGAACGGTC ACCCTCGCTG CCTTCTTCGA CTCCTTCGGC
GCCGCGGTGG ACGCGGTCAC CGCGATCATG GCCACCGGAA TCGTGGTGGC CATGGCGGAG
CTGCTCGACG GGCCGACCGT GCGGGCCGTG GACGCGGCGA CCGGCGGCGA TCTCGCCGAC
GCCGGCCAGG CCCTCCTGCT CGTCCAGACC GACGGCGCCG GAGCCGACGA CGAGGCCGAC
GCCGTCGAGG CGGTGCTGCG CGGGCCGGCC CGCGCCGTGC GCCGCGCGGC GGACCCGGCG
GCCGCGGCCG AACTGCTCGC GGCCCGCCGG GCGGCCCTGC CATCCCTCGA ACGGATCGGC
CGGGTTCTGA TCGAGGACAT CGCCGTGCCC CGCTCCCAGC TGGCGCGGGC GGCAGCGCGG
ATCACCGAGA TCAGCGCCGC CACCGGTGTG CGGATCTTCA CCATCGCGCA CGCGGCCGAC
GGAAACCTGC ACCCGATCAT CGTCGTGGAC GGCTCCGACA GGCTGAACGG AACTGACGGC
ACCGACGGGC CGGACAGGGC CGCCGACGAG ATCCCCGCCG ACGTCTGGAA GGCCGCCGAC
CTCATCTTCC AGACCGCGCT GGACCTGGGC GGCACAGTCA CCGGTGAGCA CGGGATCGGC
GCCCTCAAGC GTCGCTGGCT CGGCGCGGAG CTCGGGACGG CGAACCACTC CCTGCAGCAG
CGTCTGCGGC ACCTGTTCGA CCCGACCGGG ATCCTGTCCC CCGGCCGCGG CCTGTGA
 
Protein sequence
MTGGTAGAAA AAAAGAAFAS ALPAGRWSQD GAVVDAHRLD RSGWRPPGRP LGVAFAGGVD 
DVRAVLRTAL ASGTPVTVRG AGTGLAGGAA APDGAVILDV SGMNRIRELS VPDAVAVVEP
GVITDDLDRA AREVGLSYSP DPASSAISTI GGNIATNAGG LRCAKYGVTR ESVLGLDVVL
ADGELVSTGR RTVKGVAGYD LTGLFVGSEG TLGVVVGATL RLRPAPRRTV TLAAFFDSFG
AAVDAVTAIM ATGIVVAMAE LLDGPTVRAV DAATGGDLAD AGQALLLVQT DGAGADDEAD
AVEAVLRGPA RAVRRAADPA AAAELLAARR AALPSLERIG RVLIEDIAVP RSQLARAAAR
ITEISAATGV RIFTIAHAAD GNLHPIIVVD GSDRLNGTDG TDGPDRAADE IPADVWKAAD
LIFQTALDLG GTVTGEHGIG ALKRRWLGAE LGTANHSLQQ RLRHLFDPTG ILSPGRGL