Gene Franean1_3134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3134 
Symbol 
ID5671512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3688413 
End bp3689612 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content67% 
IMG OID641242031 
Productamidohydrolase 
Protein accessionYP_001507451 
Protein GI158314943 
COG category[R] General function prediction only 
COG ID[COG3964] Predicted amidohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACCA AGACTCACAA CACGGACAAG TCGGTGCGCT ACGACCTCGT CCTACGCGGC 
GGACGGGTGT TCGACACCAC AATCGCGCCG GCCCCGACCG TCTTGGACAT CGCCATCACC
GACGGACGGG TGGCCACCGT GGCGCCGCAC GTCGACGGTG TGGGAACGCG GGAGATCGAC
TGTACCGATC GCGTTGTGAC GCCGGGGCTG CTGGACGTGC ACGTCCACTG CTTCGAGGGA
ATGGCCATGA CCATTGGCAT GTCGTCCTAC GACGCGACCC TGCGTCGTGG CGTGGTCGGT
TGTGTCGACA CGGGAACGTC AGGAGCCTCC AACTTCCGTG GCTTCCGCCG CTTCGCGGTG
GGCGACAACG AGTTCCGCGT ACTGGCGTTC CTCAACGTCT CGGTGCTCGG AGTGACGGAC
AAGCGGCACG GCGAGTTACA GGACATCTCG GTCATCCATG TGGACGACGC GGTGAACGCC
GCCAAGGCGA ACCCTTCGAT CATCCGCGGC TTCAAGGTGC GGCTGTCCCG GAACATCGCG
TTGGAGCCGG CGAAGTCTCT GGACCTGGCC CGCGAGATCG CCGGCCTGGC GGGCCTGCCA
CTGATGGTCC ACATCAGCAA GACGGACATC AGCACCGACG ACATCCTGGC GCGGCTTGCC
CCCGGAGACG TCGTCACCCA CGCCTTCACC GGGCTCGAGG GAGGCATTGT CGAGAACGGC
TCGGTGCGAC CCGCGGCCTG GGAGGCCCGT GAACGGGGCG TGCTGTTCGA CATCGGTCAC
GGCCGCACCC AGTTCGACCA CGGGGTGGCC CGTATCGCGC TCGACGAAGG CTTCGTCCCC
GATTTCCTGG GTTCGGACCT CAGCAACGGC AACCAGTTCG GTCCGGCCTT CGATCTCCCG
ACCGTCATGG CCAAGATGGT CACCCTGGGG ATGCCGATTC AGGACGTGGT CGCGGCAACG
ACGCTCCGCG CCGCTGAGTT CCTCGGGCTG CGGGACGAGG GCTACGGCGC GATCACGGTG
GGCAGGCCGG CGTTCGTGAC CGTCATGGAG CATCTCGACC ACGTGGACTC GCTGCCGGAC
GCCTCTGGGG CGGAGCTCGA GGTCAGGCGA CTGGAGCCGC TGTTCGCCGT CAACAAGGGT
GTCGTCCACG ACTCCGATCC GTGGCGGGGC GGGCAGCCCG AGCCGCCGGC GGAGTGGTGA
 
Protein sequence
MTTKTHNTDK SVRYDLVLRG GRVFDTTIAP APTVLDIAIT DGRVATVAPH VDGVGTREID 
CTDRVVTPGL LDVHVHCFEG MAMTIGMSSY DATLRRGVVG CVDTGTSGAS NFRGFRRFAV
GDNEFRVLAF LNVSVLGVTD KRHGELQDIS VIHVDDAVNA AKANPSIIRG FKVRLSRNIA
LEPAKSLDLA REIAGLAGLP LMVHISKTDI STDDILARLA PGDVVTHAFT GLEGGIVENG
SVRPAAWEAR ERGVLFDIGH GRTQFDHGVA RIALDEGFVP DFLGSDLSNG NQFGPAFDLP
TVMAKMVTLG MPIQDVVAAT TLRAAEFLGL RDEGYGAITV GRPAFVTVME HLDHVDSLPD
ASGAELEVRR LEPLFAVNKG VVHDSDPWRG GQPEPPAEW