Gene Franean1_1564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1564 
Symbol 
ID5669967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1870625 
End bp1871800 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content66% 
IMG OID641240483 
Productamidohydrolase 2 
Protein accessionYP_001505909 
Protein GI158313401 
COG category[R] General function prediction only 
COG ID[COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.302749 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.45285 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCAAGT TGGTCGAAGG TTTACGAGTA GTTGACGCCG ACTCCCACAT GACCGAGCGC 
CATGACCTGT TCACCGAGCG GGCCCCGAAG GGCTACGAGG ACAAGGTCCC GCACGTCCAG
CGGATCAACG GCCAGGACAT GTGGGTCGTC GCCGGCAAGT CCTTCGGCCG CGCGGGCTCC
GGTGGAACGA TCGACCACGA CGGCAAGAAG CACCCGTGGA AGGACTCTCA GGGCGGGTCC
TGGGGCATCG AGAGCGTCCA CCCCGCGGCG TGGGACGCCG GCCGGCGGAT CACCCTGATG
GACGAGCTCG GCATCGACAC CCAGGTGGTC TACCCGAACG CCATCGGCAT CGGCGGCCAG
AACCTGTTCA ACGCGGTCGA CGACCCCACG GTCGTCCGGC TCTGCGTGGA GCTCTACAAC
GACGCGATGG CGGAGGTCCA GGCGGAGTCG GGCAACCGGC TGCTCCCCAT GCCGATCATG
CCAGCGTGGG ACATCCAGGG CTGTGTGCGC GAGGCGCAGC GCTGCGCGGA GATGGGCTAC
CGCGGGGTCA ACATGACCGC CGACCCGCAG GACTCCGGCT CACCGGACCT GGGCGACCCG
GCGTGGGACC CGTTCTGGGA GGTCTGTGCC GGGCTGAACC TGCCGGTGCA CTTCCACATC
GGCGCCAGCC AGACCTCGCT GTCCTACTTC GGCACGACCT ACTGGCCGAG CCAGGACGAC
TACGTGAAGC CGGCGATCGG CGGTGCGTCG CTGTTCCAGA ACAACTCCCG GCTGCTGCTC
AACAGCTGCT ACTCGGGAAT GTTCGACCGC CATCCGAACC TGAAGATGGT CTCGGTCGAG
AGCGGCATCG GCTGGATCCC CTTCATGCTC GAGGCGATGG ACTACGAGCT CGAGGAGAAC
GCGCCGGAGT ACTTCCGCAA GCTGCAGAAG CTGCCGTCGG AATACTTCGC GTCGAACTGG
TACGCGACCT TCTGGTTCGA GAAGGGCCGC GGCGACCTCC AGCATCTCGT CGACACCGTC
GGCGAGGACA ACATCATGTT CGAGACGGAC TTCCCGCACC CGACGAGCCT GCACCCGAAC
CCGCTCGAGA TGGTCACCGA GCAGGTCGGC GCGCTGCGCC CGGAGACGCA GCGCAAGATC
ATGGGTGAGA ACGCCACCAA GCTCTACCGC GTCTGA
 
Protein sequence
MVKLVEGLRV VDADSHMTER HDLFTERAPK GYEDKVPHVQ RINGQDMWVV AGKSFGRAGS 
GGTIDHDGKK HPWKDSQGGS WGIESVHPAA WDAGRRITLM DELGIDTQVV YPNAIGIGGQ
NLFNAVDDPT VVRLCVELYN DAMAEVQAES GNRLLPMPIM PAWDIQGCVR EAQRCAEMGY
RGVNMTADPQ DSGSPDLGDP AWDPFWEVCA GLNLPVHFHI GASQTSLSYF GTTYWPSQDD
YVKPAIGGAS LFQNNSRLLL NSCYSGMFDR HPNLKMVSVE SGIGWIPFML EAMDYELEEN
APEYFRKLQK LPSEYFASNW YATFWFEKGR GDLQHLVDTV GEDNIMFETD FPHPTSLHPN
PLEMVTEQVG ALRPETQRKI MGENATKLYR V