Gene Franean1_4292 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4292 
Symbol 
ID5672647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5129980 
End bp5131188 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content71% 
IMG OID641243165 
Productamidohydrolase 
Protein accessionYP_001508582 
Protein GI158316074 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.756284 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCACGA TCAGGGCTGC CGGGCTTGTC GACGTCGACC TCGGTGAGGT CGTCAGGCCC 
GGCATCCTCA AGATCGACGG AGACCGGATC GTCGGTGTCG GTGGCTCGCC GGAGGGCGAG
ATCATCGACC TCGGCGATCT GGTCCTCCTG CCCGGCCTCA TGGACATGGA GGTCAACCTG
CTGATGGGCG GCCGCGGCGA GCATGCCGTG ACCTCTCCCG TCCGGGACGA CCCCCCGCTG
CGGATGATGC GCGCCGTCGG CAACGCCCGC CGGACGTTGC GGGCGGGGTT CACCACCGTC
CGCAACCTCG GCCTGTTCTG CAAGACCGGC GGCTACCTGC TCGACGTCGC GCTGATGAAG
GCGATCGACG CGGGCTGGGT CGACGGCCCG CGGATCGTGC CGGCCGGGCA CGCGATCACC
CCGACCGGCG GCCACCTCGA CCCGACGATG TTCGGCGCGT TCGCGCCGCA CGTCCTCGAC
CTGACGGTCG AGGAGGGCAT CGCCAACGGC GTCGCCGAGG TCCGCAGGGC GGTGCGCTAC
CAGATCAAGC ACGGCGCGCA GCTGATCAAG GTGTGCGCGT CGGGTGGGGT CATGTCGCAC
ACCGGCCTGC CCGGGGCGCA GCACTACTCC GACGAGGAAC TGCGCGCGAT CGTCGACGAG
GCGCACCGCC ACGGCCTGCG GGTCGCCGCG CACACCCACG GCGCCCAGGC GGTGCGCTCG
GCCGTCGAGG CGGGTATCGA CTGCATCGAG CACGGTTTCC TCATCGACGA CGAGGCGATC
GAGCTCATGG TCAAGCACGG GACGTTCCTC GTGGCGACCC AGGCCCTGAC CGAGGGCATG
GACGTCTCCC ACGCGCCGCC CGAGCTGAGG GAGAAGGCGG GCCAGATCTT CCCCCGGGCC
CGCAACTCGA TCCGGGAGGC GATGGCCGCC GGAGTTAAGA TCGCCGTCGG TACCGACGCC
CCGGCGATCC CGCACGGCAG GAACGCGATC GAGCTGGTGA CCCTGGTCGA ACGCGGCATG
ACCCCGCTCG GCGCGATCCG GGCGGCGACC ACCACCGCGG CCGATCTGCT GGCCGTCACC
GACCGGGGCC GACTCGCCGA GGGCCTGCTG GCCGACGTCA TCGCCGTCGC CGGTGACCCC
CTGCAGGACA TCAGCACGCT GCAGAACGTG AAATTCGTGA TGAAGGGCGG CAAGACCTTT
GTCCACTGA
 
Protein sequence
MLTIRAAGLV DVDLGEVVRP GILKIDGDRI VGVGGSPEGE IIDLGDLVLL PGLMDMEVNL 
LMGGRGEHAV TSPVRDDPPL RMMRAVGNAR RTLRAGFTTV RNLGLFCKTG GYLLDVALMK
AIDAGWVDGP RIVPAGHAIT PTGGHLDPTM FGAFAPHVLD LTVEEGIANG VAEVRRAVRY
QIKHGAQLIK VCASGGVMSH TGLPGAQHYS DEELRAIVDE AHRHGLRVAA HTHGAQAVRS
AVEAGIDCIE HGFLIDDEAI ELMVKHGTFL VATQALTEGM DVSHAPPELR EKAGQIFPRA
RNSIREAMAA GVKIAVGTDA PAIPHGRNAI ELVTLVERGM TPLGAIRAAT TTAADLLAVT
DRGRLAEGLL ADVIAVAGDP LQDISTLQNV KFVMKGGKTF VH