Gene Franean1_3313 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3313 
Symbol 
ID5671685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3924929 
End bp3926215 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content68% 
IMG OID641242202 
Productamidohydrolase 2 
Protein accessionYP_001507622 
Protein GI158315114 
COG category[R] General function prediction only 
COG ID[COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.646816 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATGG ACGACTTGAT CCTTATCAGC GTGGACGACC ACGTGATAGA GCCCCCCGAC 
ATGTTCGAGG GCTTCATTCC GGCGAAGTAC GCCGACCGGG CGCCCCGGCT CGTCTCGGAC
GAGCTGAGCG ACAAGTGGGT GTTCGGCGAA GGCGAGGCCC GCAGCTCCGG CCTGAACGCG
GTGGCCGGCC GCCCGCCCGA GGAGTACGGC CTGGAGCCGA CGCGGCTGGC GGAGATCCGA
CGGGGCTGCT ACGACGTCCA CGAGCGGGTC AAGGACATGA GCGCCAACGG CGTGCTCGCC
TCCCTGAACT TCCCGTCGAT GGCCCGCTTC TGCGGTCAGT TCTTCGCCAG CCGGGCCGAC
CAGGACCCCG ACCTGGCCCT TGCCGTCCTC ACCGCCTACA ACGACTGGCA CATCGACGCC
TGGTGCGGCG CGTATCCGGA CCGGTTCATC CCCTGCTCGA TCCCGCCGCT GTGGGACCCC
CAGCTGATGG CGAAGGAGAT CCGCCGGACG GCGGCCAAGG GCTCCCATGC GGTCAGCTTC
TCGATGAACC CCTACGCCCT CGGCCTCCCG TCGTTGCACA GCGATCACTG GGACCCGTTC
TGGGCGGCCT GCGAGGAGAC CGAGACGGTC GTGTGCGTGC ACATCGGGTC AGGCGCCATC
GGCGTGGTCA CGGCACCCGA CGCCCCGATG AACGTCGAGA TCACCTGCGC CGCCATCAAG
ACCTTCCCGA CCGCCGCGGA CCTCGTCTGG TCGCCCATCT TCCAGAAGTT CAAGAACCTC
AAGGTGGCCC TGTCGGAGGG CGGGATCGGC TGGATCCCGT ACTTCCTGGA GCGGGCCGAC
TACGCCTACA AGCAGCACCG CGCCTGGACC CGCCCCGAGC TCGGCGGCCG CCTGCCGAGC
GAGATCTTCC GTGACCATGT CGTCACGTGC TTCATCGTCG ACGACTTCGG TGTGGCCAAC
CTCGATCGGA TGAACGAGGA CATGGTCACC TGGGAGTGCG ACTACCCCCA CTCCGACAGC
ACCTGGCCGC GTTCACCCGA GGTGGTGATC GACGCCGTGG CCGGACTGAC CGACCTGCAG
GTGGACAAGA TCACTCATCG CAACGCGATG CAGGTGTACT CCTTCGCCCC CTTCTCGATC
CGTCCACGCG AGCGGTGCAC CGTCGGCGCG CTGCGGAAGG AGGCCACCGG ACACGACATC
TCGATCGTCT CGCAGGGCGT CTCGGAGCGC CGGTTGACCA CGGTCGGCCA GTTCGCCCAA
GCGCACCAGC CCGGCAGGAC GGCGTGA
 
Protein sequence
MNMDDLILIS VDDHVIEPPD MFEGFIPAKY ADRAPRLVSD ELSDKWVFGE GEARSSGLNA 
VAGRPPEEYG LEPTRLAEIR RGCYDVHERV KDMSANGVLA SLNFPSMARF CGQFFASRAD
QDPDLALAVL TAYNDWHIDA WCGAYPDRFI PCSIPPLWDP QLMAKEIRRT AAKGSHAVSF
SMNPYALGLP SLHSDHWDPF WAACEETETV VCVHIGSGAI GVVTAPDAPM NVEITCAAIK
TFPTAADLVW SPIFQKFKNL KVALSEGGIG WIPYFLERAD YAYKQHRAWT RPELGGRLPS
EIFRDHVVTC FIVDDFGVAN LDRMNEDMVT WECDYPHSDS TWPRSPEVVI DAVAGLTDLQ
VDKITHRNAM QVYSFAPFSI RPRERCTVGA LRKEATGHDI SIVSQGVSER RLTTVGQFAQ
AHQPGRTA