Gene Franean1_3688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3688 
Symbol 
ID5672054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4365126 
End bp4366310 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content69% 
IMG OID641242571 
Productamidohydrolase 2 
Protein accessionYP_001507991 
Protein GI158315483 
COG category[R] General function prediction only 
COG ID[COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCACCC ACGACATTCC GGTGTTCGAC GCCGACAACC ACCTGTACGA GACCCAGGAC 
GCGCTCACCA AGTTCCTGCC GGCGCGCTAC CGGGGCGCGA TCGACTACGT CGACGTGCAC
GGCCGGACGA AGATCGTCGT GCGCGGGCAG ATCAGCCAGT ACATCCCGAA CCCGACCTTC
GAGGTCGTCG CCCGCCCGGG GGCGCAGGAG GACTACTACC GGCACGGCAA CCCCGAGGGG
AAGACGTACC GGGAGATCTT CGGCAAGCCG GTGCGGTCCA TCGACGCCTG GCGGGAGCCG
GCCGCGCGCA TCAAGGTCAT GGACGAGCAG GGGCTCGACC GCACCCTGAT GTTCCCCACG
CTCGCCAGCC TGATCGAGGA GCGGATGCGC GACGACGCGG ACCTCGTCCA CGCCGTCATC
CACTCGCTCA ACGAGTGGCT GTACGAGACC TGGCAGTTCA ACTACCAGGA CCGGATCTTC
ACCACGCCGG TGATCACCCT GCCGATCGTG GAGAGGGCCG TCGAGGAGCT GGAGTGGGTC
CTCGAGCGGG GCGCCCGGGT CATCCTCGTC CGGCCGGCGC CCGTGCCGGG CCTGCGCGGG
CCGCGCTCGT TCGGCCTGCC GGAGTTCGAC CCGTTCTGGG CCCGCGTGCA GGAGGCCGAC
ATCCTCGTCG CACTGCACTC GTCCGACAGC GGGTACGCCC GCTACAGCGG CGAGTGGATG
GGCGCCAACC GCGAGATGCT GCCGTTCCAG CCGAACCCGT TCCAGATGCT GCAGGCATGG
CGGCCGGTCG AGGACGCGGT TTCGGCGCTC GTCTGCCACG GCGCGCTCTC CCGCTTCCCC
CGGCTGAAGG TGGCCGTCGT CGAGAACGGG ATGAGCTGGG TCGCCCCGCT GATGGACGCC
ATGAAGAACC TGTACAAGAA GATGCCGCAC GACTTTCCCG AGAACCCGCT CGACGTGATC
CGGCGCAACG TCTACGTCAG CCCGTTCTGG GAGGAGGACC TCGGCGCGCT GACCAAGGTC
CTCGGTGAGG ACCACGTGCT GTTCGGGTCC GACTATCCGC ATCCGGAGGG GCTGGCGAAC
CCGGTCAGCT ACATCGACGA GCTCGCCCAC CTGCCGGAAC CGCTCGTGCG CAAGCTCATG
GGCGGCAACC TCGCCCAGCT CATGAAGGTC CCGGCCGCGG TCTGA
 
Protein sequence
MPTHDIPVFD ADNHLYETQD ALTKFLPARY RGAIDYVDVH GRTKIVVRGQ ISQYIPNPTF 
EVVARPGAQE DYYRHGNPEG KTYREIFGKP VRSIDAWREP AARIKVMDEQ GLDRTLMFPT
LASLIEERMR DDADLVHAVI HSLNEWLYET WQFNYQDRIF TTPVITLPIV ERAVEELEWV
LERGARVILV RPAPVPGLRG PRSFGLPEFD PFWARVQEAD ILVALHSSDS GYARYSGEWM
GANREMLPFQ PNPFQMLQAW RPVEDAVSAL VCHGALSRFP RLKVAVVENG MSWVAPLMDA
MKNLYKKMPH DFPENPLDVI RRNVYVSPFW EEDLGALTKV LGEDHVLFGS DYPHPEGLAN
PVSYIDELAH LPEPLVRKLM GGNLAQLMKV PAAV