Gene Francci3_2239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2239 
Symbol 
ID3905007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2610405 
End bp2612531 
Gene Length2127 bp 
Protein Length708 aa 
Translation table11 
GC content67% 
IMG OID637879570 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionYP_481336 
Protein GI86740936 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0220335 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCAGG TAAGTGTTGA CATCGGTGGC ACGTTCACCG ACTGCTTTCT TGTTTACGAC 
GACACCTACA TCGAGGCGAA GTCGCTGACC ACGCACCACA ACCTCGCCTC CGGCTTCATG
GATGCCCTGG CAAAGGCCAG TGCCCAGGCG GACCTGGATG TGCAGACGGT GCTCTCGACC
GTCGACGCGG TGCGTTACGC CACGACCCTC GGGACGAACG CGCTCATCGA ACGCAGCGGT
CCGGCGGTCG GCATCCTCAC CACCGCCGGT TTCGAATCGA CCGTGCCGTT GATGCGGGCC
CGTGGCTACG GCGACGGGCT GACCGAGGCC CAGCAGTCCG ACCTGCCCGC CGCGGACCGG
CCGGTGTCGC TGGTGCCGGT GACCCGGATC GCCGGCGTGC AGGAACGCGT CGACTACGGC
GGCACGGCGG TGCTGTCCAT CGACGAGGAC GACGTGCGCC GCCAGGTGCG CAAGCTGGTG
GACGAGGGCG CGCAGGCGTT CGTGGTCGCG CTGGTCAACG CGGTGATCAA CCCGGCCCAC
GAGAAGGAAG TCGAGCGGAT CATCCTCGAC GAGTACCCCA CGCACATGCT GGGGGCGATC
CCGATCGTGC TGTCGCACCG GGTGACCGGC CGCAAGGGTG AGTACGCCCG GACCATGTCG
GCGGTCATCG ACGCCTACCT GCACAGCCAG ATGTACCACG GGCTGGCGTC GCTGGAGATC
GAGCTGCGCC GCAACGGCTA CACCAAGCCG ATGCTGCTGG TGCACAACAC CAGCGGCATG
GCCCAGCTGA ACTCGACCTC CTCGCTGCAG ACCATCCACT CCGGGCCGGT CGCCGGGCTG
GAGGCCACCA ACTACCTGTC GAAGACCTAC CACCAGCCCA ACATCATCGC CACCGACATG
GGCGGCACCA GCTTCGACAT CGGCCTGGTC ACCTCCGACG GGGTGAAGTT CTACGACTTC
AACCCGGTCA TCGACCGCTG GCTCGTCTCC ACCCCGATGA CCTACCTGCA CACCCTCGGC
GCCGGCGGTG GCTCCATCGC CCGCTACGAC CGCCTGTGGG AGGCCATCGA GGTCGGCCCG
GAAAGCGCCG GTTCCGACCC GGGCCCGGCC TGCTACGGCC GGGGCGGGCG GTTCCCGACC
ACCACGGACG CGAACCTGGT GCTGGGCTAC CTCGACGCGG ACAACTACGC CGGTGGCACG
ATCAAGCTCA GCCTGCGCCG GGCGCAGCGG GCCATCGAGG AGCACATCTG CAAGCCCACC
GGACTGAGCC TGATCGAGGC GGCGAAGGCG ATCAAGCGCA AGGTGGACAG CAACATGGCC
AACGCGATCT TCAAGGAGGT CGCGGTCAAG GGCTATAACC CGAAGAACTT CGTCGCGCTT
AGCTACGGCG GCGGTGGTCC GCTGCACGCC TGCGGATACG CCAACACGCT GGGCATCCAG
AATGTGCTGA TCCCGCCGTT CAGCTCGGTG TTCTCCGCGC TGGGCGCCGG CAATATGAAC
CAGCTGCACA TTCATGAGTA CTCGCTGTAC CTGATGGTCT ACGACGCCAC CACCCGCAGG
ATGTTGGACG ACTTCCAGGT CTTCAACGAC ACGGTGACGG AACTTGCCGC GAAGGGCCGC
GACGACCTCC TCCGCCAGGG CGCCGACCCC GCCAACATCC GCTCGCGGGT GGAGCTGGAC
ATGCGCTACG GCAACCAGCT CGCCCAGATC GGGGTGGTGT CGCCGTTCGA GCGGCTCACC
TCACACCGCG ACGTGATCGA GCTGCTGGAC CTGTTCAGCA GCATCTACGC CAAGCGGTAC
GGCGAAGGCA GCCAGGCGCC CGAGGCCGGT GTGCGCATCA ACGTCATCCG CGTGGTCAGC
TACGTCGAAC GCGACAAGTT CGACCTGCAG CCCACCCAGG CCGAGCCGCG CCCCGCGTCG
AACCCCGCCC GGTGGCGGGA ATGCCACTAC CCGGGCATCG ACGGCGCGGT CAAGACTGCC
GTCTACGACT TCGCCGACCT CGAAGAAGGC CACGTCATCG AGGGCCCCGC CCTGATCGAA
ACCCCGAGCA CGACCTACCT CGCCGAACCC GGATGGCAGC TGACCATCGG CCGTACCGGC
TCCGCGGTGC TCGTTCGGAC CATCTGA
 
Protein sequence
MRQVSVDIGG TFTDCFLVYD DTYIEAKSLT THHNLASGFM DALAKASAQA DLDVQTVLST 
VDAVRYATTL GTNALIERSG PAVGILTTAG FESTVPLMRA RGYGDGLTEA QQSDLPAADR
PVSLVPVTRI AGVQERVDYG GTAVLSIDED DVRRQVRKLV DEGAQAFVVA LVNAVINPAH
EKEVERIILD EYPTHMLGAI PIVLSHRVTG RKGEYARTMS AVIDAYLHSQ MYHGLASLEI
ELRRNGYTKP MLLVHNTSGM AQLNSTSSLQ TIHSGPVAGL EATNYLSKTY HQPNIIATDM
GGTSFDIGLV TSDGVKFYDF NPVIDRWLVS TPMTYLHTLG AGGGSIARYD RLWEAIEVGP
ESAGSDPGPA CYGRGGRFPT TTDANLVLGY LDADNYAGGT IKLSLRRAQR AIEEHICKPT
GLSLIEAAKA IKRKVDSNMA NAIFKEVAVK GYNPKNFVAL SYGGGGPLHA CGYANTLGIQ
NVLIPPFSSV FSALGAGNMN QLHIHEYSLY LMVYDATTRR MLDDFQVFND TVTELAAKGR
DDLLRQGADP ANIRSRVELD MRYGNQLAQI GVVSPFERLT SHRDVIELLD LFSSIYAKRY
GEGSQAPEAG VRINVIRVVS YVERDKFDLQ PTQAEPRPAS NPARWRECHY PGIDGAVKTA
VYDFADLEEG HVIEGPALIE TPSTTYLAEP GWQLTIGRTG SAVLVRTI