Gene Franean1_0583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0583 
Symbol 
ID5669000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp674962 
End bp676218 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content67% 
IMG OID641239510 
Productamidohydrolase 2 
Protein accessionYP_001504948 
Protein GI158312440 
COG category[R] General function prediction only 
COG ID[COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.972073 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTCG ACGACTTGAT TCTGGTGAGC ATCGACGACC ACGTGGTCGA ACCAGCGGAC 
ATGTTCAAGA ACCACCTGCC GGCGAACCTG GCCGACCAGG CGCCGCACGT CGAGACCGAC
GAGTCCGGCG TGGACCGGTG GATCTACCAG GGCCGGGTCA CCGGGGTCAG CGGCCTCAAC
GCCGTGATCA CCTGGCCGCC GGAGGAATGG GCCAAGGACC CGGCCGGCTT CGCCGAGATG
CGCCCCGCCG TCTACGACAT CCACGACCGG GTCCGGGACA TGGACCGTAA CGGGATCCTG
GCGTCGATGT GCTTCCCGAC GTTCGCCGGG TTCAGCGCCG GGCATCTCAA CCATTTCAAG
GATCCCCTCA CCGTCATAAT GATCCAGGCG TACAACGACT GGCACATCGA CGAGTGGGCC
GGCACCTACC CGGGCCGGTT CATCCCGCTG GCGCTGCTCC CGACCTGGGA CCCGCAGCTG
ATGGTGAACG AGATCCGCCG GGTGGCGGCG AAGGGCTGCC GGGCGGTCAC CATGCCCGAG
CTGCCGCACC TCGAGGGCCT GCCCAGCTAC CACAACCTCG ACTTCTGGGC TCCGGTGTTC
GAGGCGCTGT CCGACACCGG GATGGTGATG TGCCTGCACA TCGGAACCGG GTTCGGCGCG
CTCAAGCTCG CCCCGGACGC GCCGATCGAC AACCTGATCA TCCTGGCGTG CCAGATCTCC
TCGCTGGCCG TGCAGGACCT GTTGTGGGGC CCGGCGATGC GGACCTACCC GGACCTGAAG
TTCGCCTTCT CCGAGGGCGG CATCGGCTGG ATCCCGTTCT ACCTGGACCG CTGCGACCGG
CACTACACCA ACCAGCGCTG GCTGCGCCGC GACTTCGGCG GCAAGCTGCC CAGCGAGGTG
TTCCGCGACC ACTCACTCGC CTGCTACGTC ACCGACCCGA CGTCGCTGAA GCTGCGCCGT
GAGATCGGGA TCGACATCAT CGCCTGGGAG TGCGACTACC CGCACGCCGA CTCGATCTGG
CCCGAGGCGC CGGAGTTCGT GCTCAACGAG CTGAACAACG CCGGTGCGAC CGACGAGGAG
ATCGACAAGA TCACCTGGCG GAACGCCTGC CGGTTCTTCA ACTGGGACCC GTTCTCCGAG
ATCCCCAAGG AGCGCGCGAC CGTCGGCGCC CGTCGCGCGA TCGCGACCGA CGTCGACACC
ACCATCCGCT CCCGCAAGGA ATGGGCCCGC CTCTACGCGC AGCGACAGAC CACCTGA
 
Protein sequence
MNVDDLILVS IDDHVVEPAD MFKNHLPANL ADQAPHVETD ESGVDRWIYQ GRVTGVSGLN 
AVITWPPEEW AKDPAGFAEM RPAVYDIHDR VRDMDRNGIL ASMCFPTFAG FSAGHLNHFK
DPLTVIMIQA YNDWHIDEWA GTYPGRFIPL ALLPTWDPQL MVNEIRRVAA KGCRAVTMPE
LPHLEGLPSY HNLDFWAPVF EALSDTGMVM CLHIGTGFGA LKLAPDAPID NLIILACQIS
SLAVQDLLWG PAMRTYPDLK FAFSEGGIGW IPFYLDRCDR HYTNQRWLRR DFGGKLPSEV
FRDHSLACYV TDPTSLKLRR EIGIDIIAWE CDYPHADSIW PEAPEFVLNE LNNAGATDEE
IDKITWRNAC RFFNWDPFSE IPKERATVGA RRAIATDVDT TIRSRKEWAR LYAQRQTT