Gene Franean1_4667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4667 
Symbol 
ID5673009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5571280 
End bp5572521 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content71% 
IMG OID641243524 
Productphytanoyl-CoA dioxygenase 
Protein accessionYP_001508940 
Protein GI158316432 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.180988 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.254841 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCATCG CTCTGGATCG CCGGACACGT CGCGATGCCG ATCTGCGCCA CGTCGACATG 
GAGGGCTTTC TCGCGCGGGA GTTCCCGGGA CTCGTCGCGC GCCACGGCCC TCTCGTCGCG
CGGGGGATCG CCTTCTTACA GGCACCGCCG CTCTCCATCG AGGTCGGTGC GTCCTGCTGG
TCGTTCGTGA GCGACGGCAC CACCCTGACC GCGTCCCGCG GCACCGTCGA GGGCGCGCTC
GTCGTGACGC TCAGCGAGGC CGAGTGCTCC GACTGGGCGC AGAACCAGCG ATCTTTCAAC
GCGCTGCTGA TCGCGCGCGA GCTGCGCTAC CGCGACGGGT CCGAGGTGGA CGTCTCCGCG
TGGGACTCGC TCTGGCTGAC GCTGCTGGAG GGCTGGCCGG TCGTGGACGA CACGATCGAG
TTCGTCGATC GTCACGGAGC CCCGGTCGGC CTCGGGCGCG TCTTCACCCC GGCCGACGAT
CCGAGGGACG TCGCGCATTT CCTGCGCGAG GCCGGCTACC TCCACCTGCG CGGCTGGCTG
GATCCGGCGG ACATGACCGA GGTCTCCTCC GATATCGACC GGGCGCTCCC CGACTACCGC
GAGGGTGACG GCCGTTCCTG GTGGGCGACC GTCGAGGACG GCAGCCGCCG CTGTGTGCGG
CTGCAGGAGT TCATCGCCCG CTCCCCCGCG ACCTCGGCGA TCCTGCGCGG CGAACGCTGG
GACCGGCTAC GCGGTGTGCT GGCCGGCGGC GACCCGCTCG AACGTCCCGC GCTGGACGGC
CGTGGTCCGG AAGCGCTCGT CAAGCCGGTC GGCGTCACGG TCGGCGCGTC CGACGTGAGC
TTCCACCGCG ACTGCCACTT CGGCCGCCAC GCCTACAACT GCTCGAACCT GGTCGTCGGG
ATCGCGGTCA CCGGCAGCGG CGAGACCAAC GGCCAACTCC GCGTCATCGC CGGATCCCAC
CGCGTGCTGA TGCCGGTGGA GATCGCCAAG TCGCGACCGT ACCTGCCTGT CGTCGCCGTG
CCGACCGAGC CGGGCGACGT CACGGTGCAC CTGACCTGCA CCCTGCACGA GTCGACTCCG
CCCCTCGTCG AGGAGAGGCG CGTCCTCTAC ACCGGGTTCT CCCTCGCTCC GCGCACGGAC
GACGTTAACG GCGACACCAC CAGCGACAGC GGCGGCCACG CCCTGGCGGC GCTGCGCGAG
CGGATCTCCC AGATCCTCCT CGACGAGGCG ACCCGAACGT GA
 
Protein sequence
MPIALDRRTR RDADLRHVDM EGFLAREFPG LVARHGPLVA RGIAFLQAPP LSIEVGASCW 
SFVSDGTTLT ASRGTVEGAL VVTLSEAECS DWAQNQRSFN ALLIARELRY RDGSEVDVSA
WDSLWLTLLE GWPVVDDTIE FVDRHGAPVG LGRVFTPADD PRDVAHFLRE AGYLHLRGWL
DPADMTEVSS DIDRALPDYR EGDGRSWWAT VEDGSRRCVR LQEFIARSPA TSAILRGERW
DRLRGVLAGG DPLERPALDG RGPEALVKPV GVTVGASDVS FHRDCHFGRH AYNCSNLVVG
IAVTGSGETN GQLRVIAGSH RVLMPVEIAK SRPYLPVVAV PTEPGDVTVH LTCTLHESTP
PLVEERRVLY TGFSLAPRTD DVNGDTTSDS GGHALAALRE RISQILLDEA TRT