Gene Franean1_4256 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4256 
Symbol 
ID5672611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5076867 
End bp5078087 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content74% 
IMG OID641243129 
Product2OG-Fe(II) oxygenase 
Protein accessionYP_001508546 
Protein GI158316038 
COG category[R] General function prediction only 
COG ID[COG3491] Isopenicillin N synthase and related dioxygenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.798604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.385246 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGACT CCCCCGCTCG TCCGCGCGCC GCGAGCGCGG GTGCCGCGCG GCTCCCCGTC 
CTCGACCTGC GCGACTACAC CGCGCCGGAC GGCCGTGCCG ACCGCCGGGA CTTCCTCGAC
GCGCTGCGCG CGGCCTGCCG CGATCCAGGC TTTCTCCAGC TGACCGGGCA CGGCGTACCT
TCCGGCCTCA CGGAGCGGAT CATCGCGGTG AGCCGCGCGT TCTTCGACCT GCCGCTCGCC
GCCAAACTCG AGATCGAGAA CGTCCACTCC CCGCACTTCC GCGGCTACAC CTGCCTCGGG
CACGAGATCA CCCGGGGCAG GCCGGACCTC CGCGAGCAGA TCGATATCTC CGACGAGCGG
CCGGCCCGCG TCCTCGGGCC GGACGACCCG CCCTACCTGC GTCTGGACGG GCCCAACCAG
TGGCCGGCGG CGCTGCCGGA GCTGCGCGTG GCCGCCCTCG TCTACCTGGC CGAGCTTGGG
CGCGTCGCGC GGGTGCTGGT GCGCGCGCTC GCCGAATCGC TGGGGCTGCC GCCCGACCAC
CTGGACCCGA CCTTCTCCGC CGAGCCCCGC TCCCACCTGA AGCTGCTGCG CTACCTGCCG
ACCCCTGCCG GCAGCGCCAC CGGACACGAC GCTGCCGGAC ACGACGCCGC CGAGCCCAAC
GCCGCCGCGG ACGGCCAGAA CGTCGACCAA GGCATCGGCC AGGGCGTCGG CGCGCACAAG
GACGGCGGCT TCCTGACCTT CGTCCTGCAG GACGGCGTCC GCTCAGCCCG CCAGGACGGC
GCCCACCCGG CCCCCCGCTC CCCCACGGGC CCGTCCGGCC TGCAGGTCGC CGACGGCGCG
GGCGGGTGGA TCGAGGCCGC CGCAGTGCCT GGCGCGTTCG TGGTGAACAT CGGCGAGATG
TTCGAGCTGG CCACCCGGCG TTACTACCGG GCAACCGTCC ACCGGGTCGT GAGCCCACCA
CCAGGCCACG AGCGGGTGTC CGTGGCGTTC TTCTTCGGGC CACGGCTGTC GGCCACCCTC
GAACCCATGC CGCTGCCCGA TGCCCTCCTC GCGGAGATCC CCGACGCCGA ACCACCCGAC
CCGGAGAACC CGATCTTCGC CCAGCACGGG ACGAACACCC TGAAGAGCTG GCTGCGCAGC
CATCCCGAGG TGGCCCGCCG CCACTACGCC GACGTGGCAC CCCCGGCGGG CGTGGCGCCG
ACGGCCGGAG GCGGCGCGTG A
 
Protein sequence
MPDSPARPRA ASAGAARLPV LDLRDYTAPD GRADRRDFLD ALRAACRDPG FLQLTGHGVP 
SGLTERIIAV SRAFFDLPLA AKLEIENVHS PHFRGYTCLG HEITRGRPDL REQIDISDER
PARVLGPDDP PYLRLDGPNQ WPAALPELRV AALVYLAELG RVARVLVRAL AESLGLPPDH
LDPTFSAEPR SHLKLLRYLP TPAGSATGHD AAGHDAAEPN AAADGQNVDQ GIGQGVGAHK
DGGFLTFVLQ DGVRSARQDG AHPAPRSPTG PSGLQVADGA GGWIEAAAVP GAFVVNIGEM
FELATRRYYR ATVHRVVSPP PGHERVSVAF FFGPRLSATL EPMPLPDALL AEIPDAEPPD
PENPIFAQHG TNTLKSWLRS HPEVARRHYA DVAPPAGVAP TAGGGA