Gene Franean1_6839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6839 
Symbol 
ID5675152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8338922 
End bp8340235 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content72% 
IMG OID641245688 
Productamidohydrolase 
Protein accessionYP_001511079 
Protein GI158318571 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00614522 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGC GCATCTTGAT CAGGAACGCG AAGATACTCA CCTGCTCCGC GCCGACTCCG 
CGGGCCGTGC CCGGCGTGCC CGGCGGCGGT TCCGGCGCCG GGGCCAACGT CGGCACCGGG
CCGGACGTCA TCGCCGACGG CGACCTCCTG ATCGAGGGCG ACCGGATCGC GCGGGTGCGG
GCGGGGCGGA TCGAGGTCGA CTCCGGCTCG GCCCGCGTCA TCGACCTGCA CGGGGCGGCC
GTCTTACCCG GCCTGGGAGA CGCGCACGTG CACATGAGCT GGCCGCTCGA CTTCGTCTTC
GACCACGTCT CCGTCGCGAA CGCGCCGGCC GCGCCGCACG CGCTCGACGT CGCCGCCGTG
GCCCGGACGT TCCTGGAGAG CGGCTACACG CTCGTCGTCG GGGCAGGGGT CTCCCAGCCG
TTCGACGACG TGCGCACCAG GGACGCGATC GAGCGGGGCC TCATCCCCGG CCCGCGGGTC
ATCCCCAGCG GCACGATGAT CACCGAGCGG GGTGCGATCA GCGCGGACAC CGGGATGACC
TCGGTCGTCT CCGACGCCCG GGACCTCCGC GAGGTCGTCG CCCGCCAGTG CGACACCGGC
GTCCGGGCGT TGAAACTGTT CGTCTCCGGG GACGGCATCG TCCCCGAGTA CCCCTCCGAC
GACCTCTACA TGAACGACGA GATGCTGTCC GCGGCCGTCG ACGAGGCCGA CCGGTACGGC
GCGTTCATCA CCGTGCACGC CCGCGGGTCG GACAGCGTCG CGATGGCGGC GCGCAACGGG
GCCCGGGTCA TCCACCACGC CTGCTTCCTC GACGACAAGG CGGTGCACGA GCTGGAGGCC
CGCCGGGACG ACGTCTGGGT GTGCCCCGGC CTGCACTACC TGTACGCGAT GGTCAGCGGC
CACGCCGAGC CCTGGGGCGT CACCCCGGAG AAGATCGAGC GGTCGGGCTA CGAGAAGGAG
TTCCGCGCCC AGGTCGAGGG CATCGGCATG CTGCGCGAGG CGGGCATCCG CATCCTGGCC
GGCGGCGACT TCGGCCACCA GTGGACAAAA CACGGCACCT ACGCGGCCGA GCTGCAGCGT
TACGTGGAGC TGGTCCACAT GTCGCCACAG GAGGCGATCA ACACGGCGAC CCGGAACATG
GGCCCGCTGG TGGGCCTGGA CGTCGGCCAG ATCCGCGCGG GCTACCTCGC CGACCTGCTG
ATCGTCGACG GCGACCCGCT CACCGACATC ACCGTGCTGC AGGACCCCGA CCGCCGTCGC
GCGGTCGTCA AGGGCGGGCG GTTCGCCTAC GTCAACCCGC GGATGTTCCC ATGA
 
Protein sequence
MTERILIRNA KILTCSAPTP RAVPGVPGGG SGAGANVGTG PDVIADGDLL IEGDRIARVR 
AGRIEVDSGS ARVIDLHGAA VLPGLGDAHV HMSWPLDFVF DHVSVANAPA APHALDVAAV
ARTFLESGYT LVVGAGVSQP FDDVRTRDAI ERGLIPGPRV IPSGTMITER GAISADTGMT
SVVSDARDLR EVVARQCDTG VRALKLFVSG DGIVPEYPSD DLYMNDEMLS AAVDEADRYG
AFITVHARGS DSVAMAARNG ARVIHHACFL DDKAVHELEA RRDDVWVCPG LHYLYAMVSG
HAEPWGVTPE KIERSGYEKE FRAQVEGIGM LREAGIRILA GGDFGHQWTK HGTYAAELQR
YVELVHMSPQ EAINTATRNM GPLVGLDVGQ IRAGYLADLL IVDGDPLTDI TVLQDPDRRR
AVVKGGRFAY VNPRMFP