Gene Franean1_4825 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4825 
Symbol 
ID5673166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5762339 
End bp5764099 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content73% 
IMG OID641243681 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionYP_001509097 
Protein GI158316589 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.308896 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0497189 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGCG CGGACCCGGT GCTGGCCGGG ATCGTCGACA ACCGGCTGCA CGCGATCGCC 
GAGCAGATGG CGCAGGCCAT GCTGCGTTCC TGCCGGTCGA TGGTGTTCCA GAGCCGGGAC
TTCGTCACCG GGATCTTCAC CGCCCGAGGC GAGTGGGTCG CGACCAAGGA CTACATCCCC
GTCCTGGCCG GTTCGCTGCC CAGCGCCCTC GCCGGCGTGA CCGAGCGCTT CGCCGCCGAG
TTCGCGGCCG GGGACCTGTA CGAGGGCGAC GTGTTCGTCC TCAACGATCC GTACCACGGC
AACAACCACC CGCCGGATAT CACGATCATG AAGCCGGTGT TCCACCGGGG CGTGCACCGA
TTCTGGACCG TGACGAAGGG GCATCACGCC GACACCGGCG GCGCCGTCGT CGGCTACAAC
CCGTACGCCC GGGACTGCTG GGAGGACGCA CTGCGCATCC CGCCCGTCCG GCTGCACCAC
CGCGGGCGGC GCCAGCGGGA CGTCTGGGAT CTCATCCTGC TGAACCTACG GCTGCCGGAC
GTCGTCGAGG CCGACCTGGA CTGCCAGATC GGCGCCGCCA CGCTCGGCGA GCGCGAGCTG
GTCGCGCTGC TGGACGCCTA CGGCGTCGAA CCGGTCGAGA ACGCCGTCGC CTACCAGCTC
GACGCGTCCC GCCGGCACAT GCGAGCCGAG ATCGGACGGC TGCCGGACGG CGTCTACCGG
TCGGTGCGCC ACCTCGACGA CGGGGGAGCA CACCACCGCG AGCCGATCGC GGTGCGCCTG
GAGCTGACCA TCCGCGGCGA GTCGGCGCTG TTCGACTTCA CCGCCTCCGA CCCGCAGGTC
GTCGGCTACG CGAACTCCAC CCCGGCGAAC ACCGCCGCCG CCGTCCTCAT CGGCCTGCTC
GGGGTCACCG ACCCCACCCA ACGCCGCACC AGCGGCGCGC TGGCGGCCCT CGACATCCGC
ACCACGCCGG GCACGATCGT GCACGCCGTC GAGCCGGCCG CGACGTCGCT GTGCACGCTG
ACCGCCTGCG AGACCATCGT CGAGACGGTC TGGGAGGCCG TGGGGCAGGC CGACCCGACG
GTGACGAACG CCGCCTGGTG CCGTGGCTAC ACGTTCGCGG CCATGGGGGT CGACGCGCGC
ACCGGCCGGC CGTTCGCCGT GACCGGCGCG ATGAAGGGCG GCTCGGGCGC CACCGCCGGT
TTCGACGGCT GGGACGCCGT GGGGCCGCCG GTCGCGATGG GCGGCGGACG GGAGATCGAC
GTCGAGCTGC ACGAGCTGAC CGCGCCGACC ACCGTGCTCA GCCAGGAGTA CGAGACCGAC
TCCGCCGGCG CCGGCCGGTG GCGCGGCGGC CTCGGCGCGG TGTCGCGATG GCGTATTGAC
CAGGACGACC TGTCGGTGCT GTGCATCGGA TCAGGTGCCC TCGACCTGAC AGCGCCGTTC
GGCGCGGCGG GCGGGGCGCC CGCCCCCGCC AACCGCGGCA TCGTCCACCA CGCCGACGGG
CGGATCAGCC GCCCCCCGAC GAACTCCCTG CTACGGCTTA ACCGCGGGGA CGTGGTCGAG
ATCCACACCT CCGGCGGAGG CGGTTTCGGC GACCCCCTCG ACCGTCCCGT CGACGCGGTC
GCCGACGACG TCCGCGACGG CCTCGTCTCG CCGACGGCCG CCCGCGATGT CTACGGCGTC
ATCCTCGACC CGATCACCGA TGAGGTGGAC GCCGCCGCGA CCGCCGCCCG CCGCGCGGCA
GCCCGCTCCC GCTCCCGCTG A
 
Protein sequence
MTGADPVLAG IVDNRLHAIA EQMAQAMLRS CRSMVFQSRD FVTGIFTARG EWVATKDYIP 
VLAGSLPSAL AGVTERFAAE FAAGDLYEGD VFVLNDPYHG NNHPPDITIM KPVFHRGVHR
FWTVTKGHHA DTGGAVVGYN PYARDCWEDA LRIPPVRLHH RGRRQRDVWD LILLNLRLPD
VVEADLDCQI GAATLGEREL VALLDAYGVE PVENAVAYQL DASRRHMRAE IGRLPDGVYR
SVRHLDDGGA HHREPIAVRL ELTIRGESAL FDFTASDPQV VGYANSTPAN TAAAVLIGLL
GVTDPTQRRT SGALAALDIR TTPGTIVHAV EPAATSLCTL TACETIVETV WEAVGQADPT
VTNAAWCRGY TFAAMGVDAR TGRPFAVTGA MKGGSGATAG FDGWDAVGPP VAMGGGREID
VELHELTAPT TVLSQEYETD SAGAGRWRGG LGAVSRWRID QDDLSVLCIG SGALDLTAPF
GAAGGAPAPA NRGIVHHADG RISRPPTNSL LRLNRGDVVE IHTSGGGGFG DPLDRPVDAV
ADDVRDGLVS PTAARDVYGV ILDPITDEVD AAATAARRAA ARSRSR