Gene Franean1_5681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5681 
Symbol 
ID5674007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6896942 
End bp6898384 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content75% 
IMG OID641244534 
Productallantoinase 
Protein accessionYP_001509937 
Protein GI158317429 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type
[TIGR03178] allantoinase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.642788 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGGTG GGCAGGCCGG TGCCGCGACG CCGCCGGACG CCGTCGAGGT GATCCGGTCG 
CGTCGGGTCG TACTGCCCGA CGGCGAGCGC CCCGCGGCCG TCCACGTCGT CGACGGCCGG
ATCACCGCGG TCACCGCGCC GGGCGAGGTG CCATCCGGCG TCGCCGTCAC CGATCTCGGC
GACCTCGTGC TCATGCCCGG CGTGGTCGAC ACGCATGTCC ACGTCAACGA GCCCGGCCGG
ACCGAGTGGG AGGGCTTCGC CAGCGCCACC CGGGCCGCGG CGGCCGGCGG CGTGACGACG
ATCATCGACA TGCCGCTGAA CTCGATCCCC CCCACCACCT CGCTGGACGC GCTGGCCGCC
AAGCGGGCGG CGGCCGAGGG CCAGGTCGCC GTCGACGTCG GTTTCTGGGG CGGGATCATC
GGCGCCGACG CCCGCCGGCT CGACGACCTA GCCGCGCTGC ACGACGCCGG CGTGTTCGGG
TTCAAGGCGT TCCTGGCACC CTCAGGGGTC GAGGAGTTCC CGCACGTGAG CCTCGACGTG
CTCGCCGCCG CCTCCCGGCA CACCGCCCGG ATGGACGCCC TCACCGTCGT CCACGCCGAG
TCGCCCTCCG TGCTCGCCGA GGCGCCTGAG GCGGCCGGCC GCACGTTCGC CAGCTGGCTG
CGCTCCCGCC CGCCGGCCGC CGAGAAGGCC GCGGTGGCCT CGCTCGCGGC GCTCACCGCC
TCGACGGGCG CGCGTTTGCA CGTCCTGCAC CTGGCGGCGG CCCAGGCGCT CGACGACGTC
GTCTCCGCCC GCGAGGCCGG CCTGCCCATG ACCGTCGAGA CCTGCCCGCA CTACCTGACC
TTCACCGCCG AGGAGGTCCC CGACGGCGCG ACCGTCTTCA AGTGCGCGCC GCCCATCCGG
GAACGCGCGA ACCTGGACCG GCTCTGGGAC GGCCTGGCCG CCGGCCTGTT CGCCGGCGTC
GTCACCGACC ACTCGCCAGC CACCCCGGCG CTGAAGTCCG TCGAGACCGG TGACTTCGGG
ACGGCCTGGG GTGGCATCGC CTCCGTCCAG CTGGGCCTGG CCGCGGTGTG GACCCAGGCA
CGCCGCCGCG GGCACGGCCT CGTCGACGTC GTCCGGTGGA TGTGCTCCGG CCCCGCCGAC
CTGGTCGGGC TGGGCGCCCC GGGACTGGCT ACCTCGGGAC TGGGCACCCC GGGACTGGGC
ACCGGCGGGG CGGGCTCCGT GCCGAACGGC ACCAAGGGAC GCATCGCCGT TGGCGCCGAC
GCCGACCTGG TGGTCTTCGA CCCCGACGCC ACGTTCGTCG TCGAACCGTC CCTGCTGCGC
CACCGCCATC CGCTCACGCC CTACGCTGGG CGAACGCTTG ACGGCGTGGT TCTGGCGACG
TACCTGCGGG GACGGCGGGC GGACGGTGAC CGGCCCGCGC GAGGCCGGCT GCTCTCCCGA
TGA
 
Protein sequence
MSGGQAGAAT PPDAVEVIRS RRVVLPDGER PAAVHVVDGR ITAVTAPGEV PSGVAVTDLG 
DLVLMPGVVD THVHVNEPGR TEWEGFASAT RAAAAGGVTT IIDMPLNSIP PTTSLDALAA
KRAAAEGQVA VDVGFWGGII GADARRLDDL AALHDAGVFG FKAFLAPSGV EEFPHVSLDV
LAAASRHTAR MDALTVVHAE SPSVLAEAPE AAGRTFASWL RSRPPAAEKA AVASLAALTA
STGARLHVLH LAAAQALDDV VSAREAGLPM TVETCPHYLT FTAEEVPDGA TVFKCAPPIR
ERANLDRLWD GLAAGLFAGV VTDHSPATPA LKSVETGDFG TAWGGIASVQ LGLAAVWTQA
RRRGHGLVDV VRWMCSGPAD LVGLGAPGLA TSGLGTPGLG TGGAGSVPNG TKGRIAVGAD
ADLVVFDPDA TFVVEPSLLR HRHPLTPYAG RTLDGVVLAT YLRGRRADGD RPARGRLLSR