Gene Franean1_1200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1200 
Symbol 
ID5669613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1433222 
End bp1434907 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content71% 
IMG OID641240132 
Productbeta-lactamase domain-containing protein 
Protein accessionYP_001505560 
Protein GI158313052 
COG category[R] General function prediction only 
COG ID[COG0595] Predicted hydrolase of the metallo-beta-lactamase superfamily 
TIGRFAM ID[TIGR00649] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.582889 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.255924 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCATC CGCACCCGGA GCTGGGCCCC CCACCGCCGC TGCGTGCCGA CGGCCTGAGG 
ATCATCCCGC TCGGCGGCCT CGGTGAGATC GGTCGCAACA TGACCGTGTT CGAGCATGCC
GGGCGGCTGC TCATCGTCGA CTGCGGGGTG CTCTTCCCCG AGACCGACCA GCCCGGCGTC
GACCTCATCC TGCCGGACTT CACCGCAATC CGGGACCGCC TGCAGGACAT CGAGGCGGTC
ATCCTCACAC ACGCGCACGA GGACCACATC GGCGCCGTCC CGTACCTGCT GCGCGAACGG
CGTGACATCC CCCTGGTCGG CACCCGGCTG ACGCTCGCGC TCATGGTGGC CAAGCTCGCC
GAACACCGCA TCCAGCCGGT GACCCTGCAG ATCCGCGAGG AGGAGAGGCA CTCGTTCGGT
CCCTTCGACC TCGAGTTCCT CGCCGTCAAC CACTCCATCC CAGACGCGGT CGCGGTCGCG
ATCCGCACCG ACGCGGGCCT GGTGCTGCAC ACCGGCGACT TCAAGATGGA CCAGCTCCCG
CTGGACGGGC GGCTCACCGA CCTGGGCGGC TTCGCCCGGC TCGGCCGCGA GGGCGTCGAC
CTGCTGCTCT CGGACTCGAC CAACGCCGAG GTCCCCGGCT TCGTCGCCTC GGAGCGCGCG
ATCGCCCCCG TGCTCGACAA GGTCTTCCGC GAGGCGGACA GGCGCATCGT CGTCGCGTGT
TTCGCCAGCC ACGTCCACCG CGTGCAGCAG GTGCTCGACG CCGCCGAGTC GCACGGCCGG
TCGGTCTGCT TCATCGGCCG GTCGATGGTC CGCAACATGG GTGTCGCCCG CGATCTCGGC
CTGCTGCGCG TGCCGCCCGG CCTGGTGATC GACAGCCGGG ACGTCGACTC GCTTCCCGAC
CGCAACATCT GCCTGGTATC GACCGGGTCG CAGGGCGAGC CGCTGTCCGC GCTGTCGCGC
ATGGCTAACC GGGACCACGC GATCCGGATC CAGGAGGGCG ACACGGTCGT CCTGGCCTCC
AGCCTGATCC CGGGCAACGA GACGGCCGTG TTCCGCGTGA TCAACGGTCT GACCAGGTGG
GGCGCCCGGG TCGTGCACAA GGGCGTGGCG ATGGTCCACA CCTCCGGGCA CGCCCCGGCC
GGTGAGCTGC TCTACGTCCT CAACGCGACC AAGCCGTCGA ACATGATGCC CGTCCACGGC
GAGTGGCGGC ACCTGCGTGC GCACGGCGCG CTCGCGGAGG CCACCGGTGT CCCGCCGGAC
CGGGTCATCA TCGCCGAGGA CGGCATGGTC GTCGACCTCA TCGACGGCCA GGCGGAGATC
ACCGGAGCGG TGCCCTGCGG GATGGTCTTC GTCGACGGGC TCGCCGTCGG CGACGTGGGG
GAGTCGAGCC TGAAGGACCG GCGGATCCTC GGCGAGGAGG GCTTCATCAC GATCACGGTC
GTGGTGGACG CCGCCGCCGG CAAGGTCGTC GTCGGCCCGG ATCTCTCCGC CCGCGGGTTC
TCCGACTCCC GGGCCGCGTT CGAGGAGGTC CGCGGCAAGC TCGCGGACGC CCTCGCCGAC
GCGATGCGCT CCGGCATGAC CGACACCAAC GCGCTGCAGC AGCTCGTCCG GCGCACGGTG
GGCCGTTGGG TCAACGACCG CTACCGCCGC CGCCCGATGA TCCTCCCGGT CGTCCTGGAG
GTCTGA
 
Protein sequence
MMHPHPELGP PPPLRADGLR IIPLGGLGEI GRNMTVFEHA GRLLIVDCGV LFPETDQPGV 
DLILPDFTAI RDRLQDIEAV ILTHAHEDHI GAVPYLLRER RDIPLVGTRL TLALMVAKLA
EHRIQPVTLQ IREEERHSFG PFDLEFLAVN HSIPDAVAVA IRTDAGLVLH TGDFKMDQLP
LDGRLTDLGG FARLGREGVD LLLSDSTNAE VPGFVASERA IAPVLDKVFR EADRRIVVAC
FASHVHRVQQ VLDAAESHGR SVCFIGRSMV RNMGVARDLG LLRVPPGLVI DSRDVDSLPD
RNICLVSTGS QGEPLSALSR MANRDHAIRI QEGDTVVLAS SLIPGNETAV FRVINGLTRW
GARVVHKGVA MVHTSGHAPA GELLYVLNAT KPSNMMPVHG EWRHLRAHGA LAEATGVPPD
RVIIAEDGMV VDLIDGQAEI TGAVPCGMVF VDGLAVGDVG ESSLKDRRIL GEEGFITITV
VVDAAAGKVV VGPDLSARGF SDSRAAFEEV RGKLADALAD AMRSGMTDTN ALQQLVRRTV
GRWVNDRYRR RPMILPVVLE V