Gene Franean1_3368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3368 
Symbol 
ID5671739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3993791 
End bp3995035 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content71% 
IMG OID641242256 
Productamidohydrolase 
Protein accessionYP_001507676 
Protein GI158315168 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.929128 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGCGC CGGTCACGAT CCGGGCCGCC CGCTGGCTGG ACGTCGTCGC CGGCGAGGTC 
CGCTCGCCCG CGGTGATCGT GGTGGAGGGG AACCGCATCC TGGCGGTCAA CCCCGCCGAG
ACCCCCGCCG GCGCGGTCGA GATCGAGCTG GGTGACGTGA CGCTGCTACC CGGCCTGATG
GACATGGAGC TCAACCTCCT CATCGGCGGG CCCGACACCC CGACGGGCCT GCCGCTGCCG
ATGCACGGTG TTCAGGATGA CCCGGCGTAC CGCACCATCC GCGGGACCGT CAACGCCCGC
GCCACGCTGC TCGCCGGTTT CACCACCGTC CGCAACCTCG GCCTGATGGT CAAGAGCGGG
GGCTACCTGC TCGATGTCGC GGTGCAACGT GCCGTCGAGC AGGGATGGGT GGAAGGGCCG
CAGATCATCC CGGCGGGACA CGCGATCACC CCGTACGGCG GCCACCTCGA CCCGACGGTG
TTCCAGCGCC TGGCACCCGG AATCATGCCG CTGAGCATCG GTGAGGGCAT CGCCAACGGC
GTGGGCGAGG TACGGGCCTG TGTCCGCTAC CAGATCCGGC ACGGCGCCAA GGTGATCAAG
GTGTCGGCCT CCGGCGGGGT GATGTCGCAC AGCACCGGCC CGGGCGCCCA GCAGTACTCC
GACGAGGAGC TCGCGGCGAT CGCCGACGAG GCGCACCGGG CGGACATCCG CGTCGCCGCA
CACGCGGTGG GCGACCGGGC GGTGCAGGCC TGTGTCCGTG CCGGTATCGA CTGCATCGAG
CATGGTTTCC TCGCCAGCGA CGAAACACTG CGGATGATGG CCGACCACGG CACGTTCCTG
GTGTCCACGA CCTATCTGAC CGATGCCATG GACATCGCGC GGGCAGCACC GGAGCTCCAG
CGGAAGGCGG CTGACGTCTT CCCCCGGGCG AAGGCGATGC TGCCCAGGGC CATCGCCGCC
GGGGTGAAAA TAGCCTGCGG CACCGACGCC CCGGCCGTTC CCCATGGCGA CAACGCCAAG
GAGCTGGCCG CGTTGGTCTC GCGGGGCATG ACCCCGGTGC AGGCCCTGCG GGCCGCGACC
GTCACCAGCG CGGAGCTGGT CGAGCTCGAC CACGAGCTGG GCCAGCTCAG GGACGGCTAC
CTCGCCGACA TCATCGCCGT CCCCGGCGAT CCCTCCCGGG ACATCACCCT CACCCAGGAC
GTGCGGTTCG TCATGAAGGA CGGCCGTATC CACAAGGGTG CCTGA
 
Protein sequence
MTAPVTIRAA RWLDVVAGEV RSPAVIVVEG NRILAVNPAE TPAGAVEIEL GDVTLLPGLM 
DMELNLLIGG PDTPTGLPLP MHGVQDDPAY RTIRGTVNAR ATLLAGFTTV RNLGLMVKSG
GYLLDVAVQR AVEQGWVEGP QIIPAGHAIT PYGGHLDPTV FQRLAPGIMP LSIGEGIANG
VGEVRACVRY QIRHGAKVIK VSASGGVMSH STGPGAQQYS DEELAAIADE AHRADIRVAA
HAVGDRAVQA CVRAGIDCIE HGFLASDETL RMMADHGTFL VSTTYLTDAM DIARAAPELQ
RKAADVFPRA KAMLPRAIAA GVKIACGTDA PAVPHGDNAK ELAALVSRGM TPVQALRAAT
VTSAELVELD HELGQLRDGY LADIIAVPGD PSRDITLTQD VRFVMKDGRI HKGA