Gene Franean1_0289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0289 
Symbol 
ID5668713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp339741 
End bp341312 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content72% 
IMG OID641239219 
Productpeptidase U62 modulator of DNA gyrase 
Protein accessionYP_001504661 
Protein GI158312153 
COG category[R] General function prediction only 
COG ID[COG0312] Predicted Zn-dependent proteases and their inactivated homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACCT CCGTTCCCCG GGATCCGGCC GAGCCGGCCC TGCCGCCGCA CGAGATCGAC 
GAGGAGTTCC GCGCGCTGCC CACCGCGGCC CTCACCGACG CCGCGCTCCA GGCCGCCCGA
GACCTGGGTG CCGCCCACGC CGACATCCGG ATCGAACGTC TCAAGGAGTC GTCGCTGTCG
TTCCGCGACG CCGGCCTGGA GAGCCGTTCG GACGGTGTCA CCGCGGGCTT CGCCGTCCGC
GTCGTCCACG ACGGGACCTG GGGCTTCGCC GCCGGTGTGG ATCTGACCGT GGATGAGGCC
GTGCGGGTCG CCCGGGAGGC CGTCGCCATG GCGAAGGTGG CGCGGCCGCT GAACTCCGAG
CCGGTCGAGC TGGCCGACGA GCCCGTTCAC GCCGGCGCGA CCTGGGTCTC CAGCTACGCC
ACGGACCCGT TCTCGGTCGA TCCGCGCGAC CAGGTCGAGC GGATCGGCGG GCTGTGCCGG
TCGCTGTACT CCGCCGAGGA CGTCGATCAC GTCGACGGAC GGTTCGCCGC CGTCATGGAG
AACAAGTTCT ACGCCGACAC CGCCGGGACG GACACCACCC AGCAGCGGGT CCGCGTGCTC
TGCCAGCTCG AGGCGACCCG GGTCGACCCG GCGGGTGGCT TCGAGTCCAT GCGCACCATC
GCGCCTCCGG CCGGCCGCGG CTGGGAGTGG ATGGTCGGCG GCTCGGCGGC CGGCTGCTGG
GACTGGGAGT CCGAGACCGA GCTCATCCCC TCGCTGCTGG CCGAGAAGGC GAAGGCGTCG
TCGGTGGAAC CGGGGCGCTA CGACCTGGTT ATCGACCCGT CGAACCTGTG GCTGACGATC
CACGAGTCGG TCGGGCACGC CACCGAGCTC GACCGGGCTC TCGGCTACGA GGCCGCCTAC
GCCGGGACGT CCTTCGCCAC GCTCGACAAG CTCGGGTCGC TGCGCTACGG ATCACCGGCG
ATGACCGTGA CCGGCGACCG GACGGCACCG CACGGCCTGG CCACCATCGG CTACGACGAC
GAGGGCGTGC AGACCCGCCG CTGGGACATC GTCCGCGACG GTGTCCTGGT CGGCTACCAG
CTCGACCGGC GGATGGCGGC GCAGAACGCG TCCACGCTCG GCGTGGACCG CTCCAACGGC
TGCGCCTTCG CGGACTCCCC CGGGCATGTA CCGATCCAGC GGATGGCGAA CGTCTCACTG
CTCCCCGCGC CCGGCGGGCC GTCCACGGAG GACCTGATCG GCCGGGTCGA CCGGGGGATC
TACGTGGTCG GCGACCGGAG CTGGTCGATC GACATGCAGC GCTACAACTT CCAGTTCACC
GGGCAGCGGT TCTACGAGAT CCGCAAGGGC CGGATCGTCG GCCAGCTGCG CGACGTCGCC
TACCAGGCGA CCACCACGGA CTTCTGGGGC TCGCTGGACG CCGTCGGCGG CCCCGAGACC
TACGTGCTGG GCGGGGCGTT CAACTGCGGC AAGGGCCAGC CCGGCCAGGT CGCGGCGGTC
AGCCACGGCT GCCCGTCGGC GCTGTTCCGC GACGTCAACA TCCTGAACAC GCGCCGCGAG
GGCGGCCGAT GA
 
Protein sequence
MATSVPRDPA EPALPPHEID EEFRALPTAA LTDAALQAAR DLGAAHADIR IERLKESSLS 
FRDAGLESRS DGVTAGFAVR VVHDGTWGFA AGVDLTVDEA VRVAREAVAM AKVARPLNSE
PVELADEPVH AGATWVSSYA TDPFSVDPRD QVERIGGLCR SLYSAEDVDH VDGRFAAVME
NKFYADTAGT DTTQQRVRVL CQLEATRVDP AGGFESMRTI APPAGRGWEW MVGGSAAGCW
DWESETELIP SLLAEKAKAS SVEPGRYDLV IDPSNLWLTI HESVGHATEL DRALGYEAAY
AGTSFATLDK LGSLRYGSPA MTVTGDRTAP HGLATIGYDD EGVQTRRWDI VRDGVLVGYQ
LDRRMAAQNA STLGVDRSNG CAFADSPGHV PIQRMANVSL LPAPGGPSTE DLIGRVDRGI
YVVGDRSWSI DMQRYNFQFT GQRFYEIRKG RIVGQLRDVA YQATTTDFWG SLDAVGGPET
YVLGGAFNCG KGQPGQVAAV SHGCPSALFR DVNILNTRRE GGR