Gene Franean1_4826 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4826 
Symbol 
ID5673167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5764096 
End bp5766591 
Gene Length2496 bp 
Protein Length831 aa 
Translation table11 
GC content76% 
IMG OID641243682 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionYP_001509098 
Protein GI158316590 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.112308 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCTCCG CCCCGCGGAC TCCGCGGATC CCGACTGCGC GGGCCCTGAA GGCCCCGCGG 
GGCGGCGCGG TCGGGGGATA CCGCGTCGCG GTGGACGTCG GCGGGACCTT CACCGACCTC
GTCCTGCGCC GCCCGGACGG GACGGAGGCC GTCCACAAGA CGAGCTCGAC GCCGGGGGAC
CCGGCTCGGG CCGTCGTCGC CGGGCTCACC GGGCTGGCCG CCGCCGAGAG GCTGGCGCCG
CACGAACTGC TCGGCCGCAC CGAAACCATC GTCCACGGCA CGACGATCAC CACCAACGCC
CTGATCACCG GATCGGGCGC GTGCACGGGG CTGGTCACGA CCGCCGGTCT GCGCGATGTG
CTCCCGGCCC GGCAGGGCAG CCGCGAGGAC CAGTTCGCCT CGAAGTCCGC CCCACCGCCG
TCTCTGGTGC CCCGCCGTCT CGTCATCCCC GTCCGCGAGC GGGTGGACCG GACGGGCGCC
GTGGTGATGC CACTGGACGA GTCCGACGTG CGCCGCGCGG CCGGTCGGCT GCGCGCCGCC
GGGGTGGAGG CCGTGGCGGT GAGCTTCGTC TTCTCCTATC TGTTCGACGG GCATGAGCGG
CAGGCCGCGG CGATCCTCGC GCAGGAGCTG CCGGGGGTGT TCGTCACCAC GGCATCCCGG
CTCGTCCCGC AGGTGCGGAT GTTCGAGCGG ACCACGACCA CCGCGCTCAA CGCCTACGTG
GGTCCTGTCC TGCGCGACTA CCTCGACGGG CTCGCCGACC GGCTGGCGGA GCTCGGATAC
GGCGGCCGGG TCCTGATCAT GCAGTCGAAC GGTGGTGTCG TCGGGCTGGA GCAGGCCGCC
GACCGTGCCG TCGAGACACT GCTCTCCGGG CCGGCGGGAG CGCCGGCGGC GGTCGCCGAG
ATCGTGCGTA CCCTCGGCAC GCGGGCGGCA TGCGCCGGTG CGGGCGGCAC CGGCGGCGCG
GGCGGCACCG GCGGCGCGGC CGGCGTCGAC GGCGCGGGCC CGCCGACGCG ACCGCCCGAC
ACCATCACGA TCGACATGGG CGGGACGAGC TTCGAGGCCA GCCTGGCCCG CGGCGGGCGC
ACCGAGACCA CCAGCAACGG CTCGCTGGCC GGGCATCCCG TCTGCCTGCC CGCCCTGGAC
ATCAGCACCA TCGGCGCGGG CGGCGGGTCG GTGGCCTGGG TGACCGCGGG CGGGCTGCCG
AGGGTCGGCC CGCGCAGCGC CGGGGCGGTG CCGGGGCCCG CGGGCTACGG CCGTGGCGGC
ACCGAGCCGA CCGTGACCGA CGCCGACCTG CTCCTCGGCT ACCTCGACCC GGACGGCTTC
AGCGGCGGGA TCGCGCTCTC GCGGGACGCG GCTGTCCGGG CTGTCGGTGG GCTCGCCGAG
CGACTCGGGA TCAGGGGCTC CGCCGGCCGG CATCCTGACA CCGCGGTGAC TGACGCGGCA
GTGATCGACA CGGGCGTGAT CGACGATGCG GTGATCGAGG CGGCGGCCGG TGTGCACCGC
GCGGTGAACG CGGCGATGGC CGACAGCCTG CGCCTGCTGA CCGGGCGCCG GGGCCTCGAC
CCGCGGCGGT TCGCCCTGGT GGCCGCCGGC GGCGCCGGGC CGGTGCACGC CGTCGAGATC
GCCCGTGAGC TCGGCATCCC GCTGGTGGTC GTCCCGGCCG GTGCCTCCGT CCTGTGTGCC
CGCGGGATGC TGCGGACAGA CCTGGTGCGC TACTACGTGA CGACCCTTCG CCCCGACCCG
CGCGACCCGG GCGCCCTGCC GGTCCCGTCG GACCTGGACG TGGCCTTCGA CCGGATGGCC
GCCGAGGCGC TGGACGACCT GCGTCCCCAC CTGCCCGGGA CGGGGCGGGT CGGGTTCACC
CGGAGCGCGG ATGTCCGCTA CGCCGGTCAG GTCCACGAGA TCCCCGTTCC CTTCGCGGAC
CCTGCTGTGC CCATGGACCC TGCCGTGCCC GCGGAGCCGG CCGGATCGGT GCTCCCGCGC
ACCGATGACG CCGGTCTGAA GGATCTCCTT GAGCGGTTCC ACGACGAGCA CGAGCTGCGC
TACGGGTACC GGCGCCCGGA ACTCGCGGTG GAGATCGTCA ACCTTCGCCT GGTGGCCCGC
GTCCCGACCC TCCCGCCCGT CCCGGCTGAT TCGCCCGCCC CACCTGACCC AGCTCATCCA
GCCGCCGTGG CCCCGCCGAT CCGTGCGGCG GCACTCACCG CGCCCCTCGC GCCCCTCGCG
GTGGCCCGCC GCCGCGAGGT GTGGTTCGAC GGCGGCTTCC GCGAGGTGAA CGTCCACGAC
CCGGCGACGC TGAGCCCCGG CTGCCCCCAC CAGGGACCGG CGTTGGTGGC GCATCCCGCG
AGCACCGTGG TCGTCCCGCC CGGATATGCC TTCGAGCTGG CCGAGGACGG AACCGTCCTG
GTCTTCGCGG CCACTGACAC CGCGGACGCC GTCCTGCACC GGCTCGGGCT CGTCGGCGTC
GCCGGCGCCG GCGACGCCGA CGCGGCGGCC CGATGA
 
Protein sequence
MTSAPRTPRI PTARALKAPR GGAVGGYRVA VDVGGTFTDL VLRRPDGTEA VHKTSSTPGD 
PARAVVAGLT GLAAAERLAP HELLGRTETI VHGTTITTNA LITGSGACTG LVTTAGLRDV
LPARQGSRED QFASKSAPPP SLVPRRLVIP VRERVDRTGA VVMPLDESDV RRAAGRLRAA
GVEAVAVSFV FSYLFDGHER QAAAILAQEL PGVFVTTASR LVPQVRMFER TTTTALNAYV
GPVLRDYLDG LADRLAELGY GGRVLIMQSN GGVVGLEQAA DRAVETLLSG PAGAPAAVAE
IVRTLGTRAA CAGAGGTGGA GGTGGAAGVD GAGPPTRPPD TITIDMGGTS FEASLARGGR
TETTSNGSLA GHPVCLPALD ISTIGAGGGS VAWVTAGGLP RVGPRSAGAV PGPAGYGRGG
TEPTVTDADL LLGYLDPDGF SGGIALSRDA AVRAVGGLAE RLGIRGSAGR HPDTAVTDAA
VIDTGVIDDA VIEAAAGVHR AVNAAMADSL RLLTGRRGLD PRRFALVAAG GAGPVHAVEI
ARELGIPLVV VPAGASVLCA RGMLRTDLVR YYVTTLRPDP RDPGALPVPS DLDVAFDRMA
AEALDDLRPH LPGTGRVGFT RSADVRYAGQ VHEIPVPFAD PAVPMDPAVP AEPAGSVLPR
TDDAGLKDLL ERFHDEHELR YGYRRPELAV EIVNLRLVAR VPTLPPVPAD SPAPPDPAHP
AAVAPPIRAA ALTAPLAPLA VARRREVWFD GGFREVNVHD PATLSPGCPH QGPALVAHPA
STVVVPPGYA FELAEDGTVL VFAATDTADA VLHRLGLVGV AGAGDADAAA R