Gene Franean1_0715 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0715 
Symbol 
ID5669131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp831541 
End bp832806 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content67% 
IMG OID641239642 
Productamidohydrolase 2 
Protein accessionYP_001505079 
Protein GI158312571 
COG category[R] General function prediction only 
COG ID[COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.548307 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.202606 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACATGG ACGACATGAT TCTGCTGAGC ATCGACGACC ACGTGATCGA GCCGCCGGAC 
ATGTACAAGA ACCATGTCCC GGCGAAGTGG CTCGATTCCG TGCCGAAGGT CGTCCGGAAC
GAGGCCGGCG TCGACGAGTG GGTGTTCCAG GGCGAGAAGA CGTCCACACC GTTCGGTATG
GCGGCGACCG TCGGCTGGCA CCGGGAGGAG TGGGGATTCA ACCCCGGCGC CTTCACCGAG
TTACGTCCGG GCTGTTTCGA GGTCCACCAG CGGGTCCGCG ACATGAACGC CAACGGTGTC
CTCGCCTCGA TGTGCTTCCC GACGATGGCG GGCTTCAACG CCCGCACGTT CTCCGAGGCC
CTCGACAAGG ACCTCTCGCT CATCATGCTG CAGGCCTACA ACGACTGGCA CATCGACGAG
TGGTGCGGCG CCTACCCGGG CCGGTTCATC CCCCTCGGCA TCGTGCCGAT GTGGGACGTC
GAGCTCGCGG TGAAGGAGAT CCGGCGGATC GCCGCGAAGG GCTGCCGCTC CATCAGCTTC
CTGGAGGCCC CCCACGCGCA GGGCTGGCCG AGCTTCCTCT CCGGCCACTG GGACCCGATG
CTGCAGGCCC TCGTCGACGA GAACATGGTG CTCAGCCTGC ACATCGGCGG CGCCTGGGAC
ATCGTCAAGC TCGCCCCCGA GGTGCCGATC GACCACATGA TCGTCATTCC GTCCCAGCTC
ACCATGCTCA CCGCGCAGGA CCTGCTCTTC GGCCCGACAC TGCGGCGCTT CCCCGAGCTG
AAGGTGGCCC TCTCCGAGGG TGGCATCGGC TGGATCCCGT TCTACCTGGA CCGCGTCGAC
CGGCACTTCC AGAACCAGAG CTGGATCCAC AACGACTTCG GCGGCAAGCT GCCCTCCGAG
GTGTTCCGGG AGCACTTCCT GGCCTGTTAC ATCACCGACC CGGCCGGGCT GCGCCTGCGC
GAGCAGATCG GCATCGAGAC CATCGCCTGG GAGTGCGACT ACCCGCACAC CGACACGACC
TGGCCCGAGT CACCCGAGCA CGCCTGGAAC GAGCTCCAGC AGGCCGGCTG CCGCGACGAC
GAGATCCACC AGATCACCTG GGAGAACGCC AGCCGCTTCT TCGGCTGGGA CCCGTTCTCC
CACACGCCGA GGGAGCAGGC CACCGTGGGC GCGCTGCGCG GGCTGGCCGC CGATGTCGAC
GTCACCCGGA TGTCGCGCGA GGAGTGGCGC AAGCGCAACG AGGCCGCGGG AATCGGCGTC
TTCTAA
 
Protein sequence
MHMDDMILLS IDDHVIEPPD MYKNHVPAKW LDSVPKVVRN EAGVDEWVFQ GEKTSTPFGM 
AATVGWHREE WGFNPGAFTE LRPGCFEVHQ RVRDMNANGV LASMCFPTMA GFNARTFSEA
LDKDLSLIML QAYNDWHIDE WCGAYPGRFI PLGIVPMWDV ELAVKEIRRI AAKGCRSISF
LEAPHAQGWP SFLSGHWDPM LQALVDENMV LSLHIGGAWD IVKLAPEVPI DHMIVIPSQL
TMLTAQDLLF GPTLRRFPEL KVALSEGGIG WIPFYLDRVD RHFQNQSWIH NDFGGKLPSE
VFREHFLACY ITDPAGLRLR EQIGIETIAW ECDYPHTDTT WPESPEHAWN ELQQAGCRDD
EIHQITWENA SRFFGWDPFS HTPREQATVG ALRGLAADVD VTRMSREEWR KRNEAAGIGV
F