Gene Franean1_4056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4056 
Symbol 
ID5672414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4835802 
End bp4837544 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content77% 
IMG OID641242932 
ProductFAD dependent oxidoreductase 
Protein accessionYP_001508349 
Protein GI158315841 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.194203 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCGG ACGCGGTCGT CATCGGCGCG GGGGTGAACG GCCTCGTCGC GGCGAACCGG 
CTCGCCGACG CCGGCTGGGA CGTCGTCGTG TGCGAGGCCG CCGACGAGCC GGGCGGGGCC
TGCCGCTCCG CCGAGGTCAC CGCGCCCGGG TTCGGTACCG ACCTGTTCAG CGCCTTCTAC
CCGTTCGCCG CGCGGTCCCC GGCGCTGCGC GCGCTCGACC TCACCGACCA CGGGCTGACC
TGGCTGCACG CCCCGCGGGT GCTGGCCCAC CCCACCCCGG ACGGCCGCTG CGCGGTGCTG
TCCACCGACC TCGACGGGAC GGCCGCATCC CTCGCCGCCT ACGCCCCGGC CGACGCGGCG
GCCTGGCGGG CGCAGGTCGC ACTGTGGGAA CGGGTCCGCG ACCCGCTGCT CGAGGCGCTG
CTGGCCGCCC CGTTCCCGCC GGTCCGGGCG GGCGCACGGC TCGCCCGCGC GCTCGGGGCC
GCCGACGCGC TGCGCTTCGC CCGGTTCGCG ATGCTGCCCG TCCGCCGGTT CGCCGCGGAG
GAATTCGCCG GCGCGGGCGC CGGACTGCTG GTCGCCGGCA GCGCGCTGCA CACCGACCTG
GCGCCGGAGT CGGCGGGATC GGCGCTGATC GGCTGGCTGC TGGCGATGCT CGGCCAGGAT
GTCGGCTTCC CGGTGCCGCG TGGCGGCGCG GGCCGGCTGA CCGCCGCCCT CGTCGACCGG
CTGCGCTCCC GCGGCGGGGT GGTGCGCACT CGAGCCGAGG TGGACGCGGT GATCGTCACC
GCCGGCCGGG CCCGTGGCGT GCGGCTCACC GACGGGACGG CCGTGCGCGC CCGGCGCGCG
GTGCTCGCCG ACGTGGACGC GGTCTCCCTC TATCGGCGCC TGGTCGGCGA CGAGCATCTC
CCCGCCCGCC TGCTCACCGA CCTGGCCCGC TTCCAGTGGG ACAGCTCGAC CTTCAAGGTC
AACTGGGCGC TCGCCGGGCC GATCCCGTGG TCCGACCGCC GGATCGCGGA CGCCGGCACC
GTCCACCTCG GCGGCACCAT GGACGATCTG ACGATGATGT CCGCGCAGCT GGCGTGCGGG
CTCGTGCCGG CGGACCCGTT CCTCGTCCTC GGCCAGATGA CCACTGCCGA CCCGGGCCGC
TCGCCGGCCG GGACGCAGAG CGCGTGGGCG TACTTCCACC TCCCGCAGTC CCCCCGGGGC
GACGCCGGTG GGGCCGGTGT CACCGGCCGC TGGGACGGCG ACGACACCGC CCGCCTCCTC
GAGCGGGTGG AACGCAAGCT GGAGGCGGCG GCCCCCGGCT TCGGGTCGCT GATTCTCAGC
CGCGACACCC AGTCGCCGCG ACGGCTGGAG GACCAGGACG CCGTGCTGCG CGGCGGCGCC
CTCAACGGCG GGACGGCCGC CCTGCACCAG CAGCTGATCT TCCGCCCGGT GCCCGGCCTG
GGCCGCCCGG AGACCCCGAT CCCCGGGCTC TATCTGGCGT CGATGTCAGC ACATCCCGGC
GGCGGTGCGC ACGGCGGGCC GGGCGCCATG GCGGCCACGG TGGCGCTGCG CGACGCCGGC
CCGGCCGGCC CCGTCCGCCG CCGCGCGTCC GCCGCCGCGC ACCGGCTGAT CTACGCTGGG
CCGGTGGCGT CGCCCGCGGC GCCAGGCCCG GATTCTCCAG GGCCGGCGAG TCCGGGCCCG
GCGAACGAGT CCTTGAGCGC CTCCGACACC TCGCGTGCGA CCCGTCCAAA AGCGGCGTCG
TAA
 
Protein sequence
MTADAVVIGA GVNGLVAANR LADAGWDVVV CEAADEPGGA CRSAEVTAPG FGTDLFSAFY 
PFAARSPALR ALDLTDHGLT WLHAPRVLAH PTPDGRCAVL STDLDGTAAS LAAYAPADAA
AWRAQVALWE RVRDPLLEAL LAAPFPPVRA GARLARALGA ADALRFARFA MLPVRRFAAE
EFAGAGAGLL VAGSALHTDL APESAGSALI GWLLAMLGQD VGFPVPRGGA GRLTAALVDR
LRSRGGVVRT RAEVDAVIVT AGRARGVRLT DGTAVRARRA VLADVDAVSL YRRLVGDEHL
PARLLTDLAR FQWDSSTFKV NWALAGPIPW SDRRIADAGT VHLGGTMDDL TMMSAQLACG
LVPADPFLVL GQMTTADPGR SPAGTQSAWA YFHLPQSPRG DAGGAGVTGR WDGDDTARLL
ERVERKLEAA APGFGSLILS RDTQSPRRLE DQDAVLRGGA LNGGTAALHQ QLIFRPVPGL
GRPETPIPGL YLASMSAHPG GGAHGGPGAM AATVALRDAG PAGPVRRRAS AAAHRLIYAG
PVASPAAPGP DSPGPASPGP ANESLSASDT SRATRPKAAS