Gene Franean1_2692 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2692 
Symbol 
ID5671083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3185301 
End bp3186479 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content70% 
IMG OID641241604 
Productamidohydrolase 2 
Protein accessionYP_001507024 
Protein GI158314516 
COG category[R] General function prediction only 
COG ID[COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTGACC AACGCCCCAT CGACGCCGAC AACCACTACT ACGAGCCGCT GGACGCGTTC 
ACCCGCCACC TCGACCCGGC GTTCACCCAA CGCGGCGTGC AGGTCCTGCA GAAGGGCAAG
CGCGTCGTCG TGGTGATCGG CGGCCGGGTC AACACCTTCA TCCCCAACCC CACCTTCAAT
CCGGTGACCA AGCCTGGCTG CCTCGACCTG TACTTCCGCG GGGTCATGCC CGAGGGGGTG
AGCCGGCGGA CCCTGATGGA GGTCGAACCC CTGGCACCGG AGTACCGCGA CCGCGACGTG
CGGATAGCCC GACTCGACGA GCAGGGCCTG GCCGGCGCGG TGCTGTACCC GACGATGGGC
GTCGGAGTCG AGGAAGCGCT GCGTGACGAC GTCCCGGCGA CCATGGCCAG CCTGCACGCG
TTCAACCGGT GGCTGGAGGA CGACTGGGGC TACTCCTATC AGGACCGCCT GTTCGCCGTG
CCGCTGATCT CCCTGGCTGA TCCGCAGGCA GCGGTCGCCG AGGTGGAGCG GGTGCTCGGC
CTCGGTGCGC GCATCGTCCA CGTCCGCCCC GCACCCGTGC CCGCACCGGG GACGGGAACC
AGCGGGCGGT CGCTGGGCCA TCCCGCGCAC GACCCGGTGT GGGCGCGCCT CGCGGAGGCG
GACGTACCGG TGGCGTTCCA CCTGGGGGAC AGCGGCTACC ACCGGATATC GGCGATGTGG
GGCGGTTCGG CGACCCTGGA GGCGTTCGGG AAGACGAACG TCCTCGCCAA GATCGTCGTC
GGGGAGCGGG CCATCCAGGA CACGATGGCC AGCCTCGTCG TCGACGGCGT GTTCGCCCGC
CACCCGCGGC TGCGGGCGGT GAGCATCGAG AACGGCTCGT CCTGGGTGAA GCCGCTGCTG
CGGCTGATGA AGAAGTACGC CAACCAGTCG CCGGAGAGCT TCTCCGGCAA CCCGGTCGAA
GCGTTCACCG AGCACGTGTG GGTGGCGCCC TACTACGAGG ACGACATCGC CGGGCTGGTC
GAGCTCATCG GCGCCGACCA CGTCCTGTTC GGATCGGACT GGCCGCACGC CGAGGGCCTG
GCCGAACCGC TCCAGTTCGA CAAGGAGATC GAGTGCTTCG ACGCCACCAC CAAGGCTCGG
ATCATGCGGG GCAACTCCGC CGCGCTCCTC GGGCTGTGA
 
Protein sequence
MTDQRPIDAD NHYYEPLDAF TRHLDPAFTQ RGVQVLQKGK RVVVVIGGRV NTFIPNPTFN 
PVTKPGCLDL YFRGVMPEGV SRRTLMEVEP LAPEYRDRDV RIARLDEQGL AGAVLYPTMG
VGVEEALRDD VPATMASLHA FNRWLEDDWG YSYQDRLFAV PLISLADPQA AVAEVERVLG
LGARIVHVRP APVPAPGTGT SGRSLGHPAH DPVWARLAEA DVPVAFHLGD SGYHRISAMW
GGSATLEAFG KTNVLAKIVV GERAIQDTMA SLVVDGVFAR HPRLRAVSIE NGSSWVKPLL
RLMKKYANQS PESFSGNPVE AFTEHVWVAP YYEDDIAGLV ELIGADHVLF GSDWPHAEGL
AEPLQFDKEI ECFDATTKAR IMRGNSAALL GL