Gene Franean1_3308 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3308 
Symbol 
ID5671680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3919551 
End bp3920810 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content71% 
IMG OID641242197 
Productamidohydrolase 
Protein accessionYP_001507617 
Protein GI158315109 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.692035 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.417277 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGGCG TCCCCGGTGC TACCGGCGCG ACCGTTCTGC GCGCGGCACG CTGGGTCGAC 
GTCGACGCGG GGACGGTGCG CTCGCCCGCG GTCGTGGTGG TCGAGGGGAA CCGCATCACC
GCGGTGAACC CGGCCGTGCC GCCCACCGGC GCGACCGAGA TCGATCTGGG TGCTCTCACT
CTGCTGCCGG GCCTGATGGA CATGGAGATC AATCTCCTCC TCGGTGGCCC CGAGAACCCG
ACGGGCCTGC CGAACCCGCT GCACGGCGTC CAGGACGACC CGGTGTACCG GACCCTGCGG
GCGACGGTGA ACGCCCGCAC CACGCTGCTG GCCGGTTTCA CGACGGTGCG CAACCTCGGG
CTGATGGTCA AGACCGGGGG GTATCTGCTG GACGTGGACC TGGCCCGGGC GATCGAAGCG
GGCTGGGTAC CGGGGCCCCG GATCGTGGCC GCCGGTCACG CGATCACGCC CACCGGCGGG
CACCTCGACC CGACGATGTT CCAGCGGCTC GCGCCGCACA TCATGCCGCT GGGCGTCGAG
GAGGGGATCG CCAACGGCGT CCCGCAGGTG CGCGCGGCGG TCCGCTACCA GATCAAGTAC
GGCGCCGGGG TCATCAAGAT CTCGGCGTCG GGCGGGGTGA TGTCGCACAG CACCGCCGCC
GGCGCGCAGC AGTACTCCGA CGAGGAGATC GCGGCCATCG TCGACGAGGC CCACCGGGCG
GGGCTCAAGG TGGCTGCCCA CGCCCACGGC GACGCGGGCA TCCGGGCCTG TGTCCGGGCC
GGGGTGGACT GCATCGAGCA CGGCTCACTG GCCAGCGACG ACACCATCCG GATGATGGTC
GACCATGGGA CTTTCCTCGT CCCTACCAGC TATCTGTCGG AAGGCCTCGA CATCTCGAAG
GCGGCGCCCG CGCTCCAGGC GAAGGCCGCG GAGGTCTTCC CCCGGGCTCG GCGGACGCTG
GGTAGGGCCA TCGAGGCCGG GGTGCGGATC GCGTGTGGCA CCGACGCGCC CGCCATCCCG
CACGGGCACA ACGCGAAGGA GCTGTGGGCT CTGGTCGACC GCGGCATGAC CGCGATGCAG
GCGCTGCGGG CCGCCACGGT CACCAGCGCC GAGCTGATCG GTGTCGATGA CCGCGGTCGC
CTGGCGGCTG GTCTGCTGGC CGACATCATC GCGGTTCCCG GAGATCCATC CGATGACATC
ACGGCCACGC AGGACGTGCG GTTCGTGATG AAGGACGGCC TCGTCTACAA GAACGAGTAG
 
Protein sequence
MTGVPGATGA TVLRAARWVD VDAGTVRSPA VVVVEGNRIT AVNPAVPPTG ATEIDLGALT 
LLPGLMDMEI NLLLGGPENP TGLPNPLHGV QDDPVYRTLR ATVNARTTLL AGFTTVRNLG
LMVKTGGYLL DVDLARAIEA GWVPGPRIVA AGHAITPTGG HLDPTMFQRL APHIMPLGVE
EGIANGVPQV RAAVRYQIKY GAGVIKISAS GGVMSHSTAA GAQQYSDEEI AAIVDEAHRA
GLKVAAHAHG DAGIRACVRA GVDCIEHGSL ASDDTIRMMV DHGTFLVPTS YLSEGLDISK
AAPALQAKAA EVFPRARRTL GRAIEAGVRI ACGTDAPAIP HGHNAKELWA LVDRGMTAMQ
ALRAATVTSA ELIGVDDRGR LAAGLLADII AVPGDPSDDI TATQDVRFVM KDGLVYKNE