Gene Franean1_6836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6836 
Symbol 
ID5675149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8332066 
End bp8333265 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content70% 
IMG OID641245685 
Productamidohydrolase 2 
Protein accessionYP_001511076 
Protein GI158318568 
COG category[R] General function prediction only 
COG ID[COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0916141 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACTACC AGCTGATCGA TGCCGATGGC CACTACTACG AGCCCGACGA CTGCTTCTCC 
CGGCATATCG AGGCCGGTTT CAAGGAGCAC ACCGTACGGG TCGAGCGCGG GGCCGACGGC
CTGGGCCGGG TCTATCTGGG CGACCGCCGC ACATTCATGA GCGTGATGCC CGGGGACTAC
GCGTCCGCCC CCGGCGCGCT GCAGGGGCTG TTCGTCGGCG AGGTGGCGGA CGGCTTCACC
CACCGCGAGG TCCTGAACGC GAAGGACCAC CCCGCGTTCA TCGAACGGCC CGCGCGGCTG
GACCTGATGG ACGACCAGGG TGTCGAGGCG ACCATCATGC TGCCCACCCT CGGCGTGGCC
GTCGAGCAGG ACATGGCGGA CGACGTCGAG CTGACCTACG CCAGCCTGCG CGCGTTCAAC
CGGTGGCTCG AGGAGGACTG GGGCTACGCC GAGCAGGACC GGATCTTCGC CGTTCCGATG
CTCTCCCTGC TCGACATCGA CCACGCCGTG GCCGAGCTGA AGCGGGTGCT GGACGCCGAC
GCGCGCCTGG TGCACCTGCG CCCCGGGCCG ATCGGCGGCC GCTCCCCCGC GCACCCCGAC
TTCGACCGGT TCTGGGCGAT GGCGGCCGAG GCCGGGGTCG GGGTCGTGTT CCACGTCTCC
AACAGCGGTT ACAACGCGGC GTACGGCCAG CTCTGGTCCG AGGACGCGGG CAACCCGTCG
CACCTGCAGT CGCCGCTGCA GTGGGCGCTG TGCAACACCG AGCGGCCGAT CGTCGACACG
CTCAGCGCGC TCGCCCTGCA CAACCTCTTC GGCCGACACC CCAACATCAA GATCATCTCG
ATCGAGAACG GCAGCAACTG GGTGCGACCG CTGCTGAAGA CGGTCGACAA GGCCGCCGCG
CTCGGCCGGC GCGGCCCGAT GATCGGCGGC ACGCTCTCGG CGAAGCCCAG CGAGATGCTC
GCCGAGCACC TGTGGGTCTG CCCGTTCCCC GAGGACGACG TGCACGACCT CATCAGCGTG
CTCGGCCCGG ACCAGGTCCT CTTCGGTTCG GACTACCCGC ACCCCGAGGG GCTCCGCCAG
CCCATGGACT ACGTCGAGCG CCTCGACGAC TGCGACCCGG TCACGCGGCG CAAGGTGCTG
CGCAGCAACA CCGCCGACCT GCTCCGGATC CCCGACAAGG AGACCGCCAA GTCCGCGTAG
 
Protein sequence
MDYQLIDADG HYYEPDDCFS RHIEAGFKEH TVRVERGADG LGRVYLGDRR TFMSVMPGDY 
ASAPGALQGL FVGEVADGFT HREVLNAKDH PAFIERPARL DLMDDQGVEA TIMLPTLGVA
VEQDMADDVE LTYASLRAFN RWLEEDWGYA EQDRIFAVPM LSLLDIDHAV AELKRVLDAD
ARLVHLRPGP IGGRSPAHPD FDRFWAMAAE AGVGVVFHVS NSGYNAAYGQ LWSEDAGNPS
HLQSPLQWAL CNTERPIVDT LSALALHNLF GRHPNIKIIS IENGSNWVRP LLKTVDKAAA
LGRRGPMIGG TLSAKPSEML AEHLWVCPFP EDDVHDLISV LGPDQVLFGS DYPHPEGLRQ
PMDYVERLDD CDPVTRRKVL RSNTADLLRI PDKETAKSA