Gene Franean1_4239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4239 
Symbol 
ID5672594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5046789 
End bp5048033 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content76% 
IMG OID641243112 
Productamidohydrolase 
Protein accessionYP_001508529 
Protein GI158316021 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.943042 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCCA CGTCTGCCTC ACCCACCGCG CGCACCGGGC AGGCCGGGCC CGCGATCGAG 
TCCGTGCTGG AGAGCATCTC GGCCGCGCTG GCCGTGCTGC GGCCCCGGAT GGACGCGGTG
AGCCTGGCCA TCCACGCCCG GCCGGAGCTG AAGTTCGCCG AGTTTCACGC CCGGGACGTG
CTGACCGGCT GGCTCGGGGA GTCCGGTTTC ACCGTCCGGG TGCCGGCGGG CGGCCTGGAC
ACCGCCTTCG TGGCCGTGCA CGAGGGGGCG GAGCCCGGCC CGTGCGTCGC CGTCCTCGCA
GAGTACGACG CGCTGCCTGG TGTCGGGCAC GGCTGCGGGC ACAACCTCAT CGCGGCCGGG
GGTGCGGGCG CGGCGATCGC GGCCGTCCGC GCGCTGCCCG CCCACCCCGG CACTATCGCC
GTCATCGGCA CGCCCGGTGA GGAGATGGGC GGCGCGGGCA AGATCCGGCT CGCCGAGGCC
GGGGTCTTCG ACGGCGTCGA CGCGGCGGTG ATGTTCCATC CCGGCGACCG GTCGCTGACC
GGCCGGCCCG GGCTGGCCGC GGCCCACCTG CGGGTCGCGT TCGCGGGGAC GAGCGCGCAC
GCGGCCCTCT CGCCCTGGTC GGGGCGCAGC GCGCTGGCGG GAGCCCAGCT GTTCCTCAAC
GCACTCGACA CGATGCGCCA GTTCGTCCCG CCGAGCGCCC GGCTGCACGG CATCATCTCG
GACGGCGGCC AGGCCCCGAA CGTCGTCCCG GCCCACGCCG CGGTGGACCT GTACGTCCGG
GACGGCACGG CAGCCTCGGT CGAGGAACTG GTCGAGCGGG TCCGCGCGGC GGCCGCGGGC
GCGGCGCTCG CCACCGGGAC GGCGGCGGAG GTCACCGAGA CCGGCCCGCT GTATGCGGAG
CGCCGCGACA ACACGGTGCT CGCCGAGCGG TTCGCGGCGG CGGTGCGCGC GCTGGGTGTG
GACATCGCGC CCGGTGACCC GCGCGGCCCC GCCGGCTCCT CCGACATCGG CAACCTCTCC
CAGCTGCTGC CGGTCATCCA CCCGTACATC CAGATCGCCG AGGTCGGTAC GCCCGGTCAC
TCCGACGCAC TGCGCGAGGC GGCGGCCACG GCGTTCGCCC ACGACCGCAC CCAGGTCGCG
GCGGCAGGGC TGGCCTGGGT GGTCACCGGC CTGCTCACCG AGCCGGGCCT GCTGGCGGCG
GCACGGGCGG AGTTCACGAC GGTGTCCACG GATGGCACGG ACTGA
 
Protein sequence
MPATSASPTA RTGQAGPAIE SVLESISAAL AVLRPRMDAV SLAIHARPEL KFAEFHARDV 
LTGWLGESGF TVRVPAGGLD TAFVAVHEGA EPGPCVAVLA EYDALPGVGH GCGHNLIAAG
GAGAAIAAVR ALPAHPGTIA VIGTPGEEMG GAGKIRLAEA GVFDGVDAAV MFHPGDRSLT
GRPGLAAAHL RVAFAGTSAH AALSPWSGRS ALAGAQLFLN ALDTMRQFVP PSARLHGIIS
DGGQAPNVVP AHAAVDLYVR DGTAASVEEL VERVRAAAAG AALATGTAAE VTETGPLYAE
RRDNTVLAER FAAAVRALGV DIAPGDPRGP AGSSDIGNLS QLLPVIHPYI QIAEVGTPGH
SDALREAAAT AFAHDRTQVA AAGLAWVVTG LLTEPGLLAA ARAEFTTVST DGTD