Gene Franean1_5323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5323 
Symbol 
ID5675759 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6413690 
End bp6414958 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content70% 
IMG OID641244181 
Productintegrase family protein 
Protein accessionYP_001509587 
Protein GI158317079 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0388032 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAGCA ACACCAACGG CCCGGCGAAC GGCCGGCGCA AGGAACGCAC CAAGCGCGCG 
AACGGTCAGG GCTCGATCTA CCAGCGCGGG GACGGCCTGT GGGCCGGCGC GGCCTACGTC
CTCATGCCGG ACGGCACGAC GAAGCGGCGG CCGGTCTACG GCAAGTCGGA GGAGATCGTC
CGCGGCAAGC TGACCGAGCT TCAGGCCAAC TCGGATCAGG GCATTCCGGC GGACGCTACC
GGGTGGACGG TCGAGCGGTA CCTGACGGCC TGGCTCGCTC AGACCGTGAA GCCGAACCGT
CAGCCGAACA CGTACGTGAC CTATGAGAAG GCGGTCCGGA TCTACCTCGT GCCGGGTCTC
GGGAAGAAGC GGCTGAACAG GCTGACCGGT GCGGACGTCC GGCAGTTCAT CCGGCGGACG
GAGAGCACCT GCCGGTGCTG CGCCCGGGGA GTGGACAAGG CACGACCGGA GGAGGAGCGG
CGTTGCTGCG CGGTCGGCCG GTGCTGCAAG CGGTTGCCGT CGAAGCGTCA GGTTCAGGTC
GTGCATGCGG TGCTGCGTAA CGCTCTCCAG GCGGCGGTCC GGGAGGAGCT GATCCGGCGG
AACGTCGCGA AGCTGGTGCA GGTCTCCACC CCGCGCTACG GCGTCGACCG CGGCCTGACC
GTCGAGCAGG CGCATCAGCT CCTCGACGCG GCGGCGGGCG ACCGGCTGTA CGCGCTGCTG
GTCCTGGCGC TGTTCCTCGG GATGCGGCGC GGGGAGTTGC TCGGTCTGCA GTGGTCGGAC
ATCGACTCTG AGCGGGAGAC GCTGACGGTG CGTCACACGC TGCTGCGGGT CGGCGGGGAG
CTGCGGCTGT GCCCGCCGAA GACGGAGGAC TCCGAACGGA CGCTGCCGCT CCTCGGCCTG
GTCGCGGATG CGCTGGCGGA GCATCGCAAG CTCCAGGACG CTGAACGTGC GGCGGCCGGC
GACGCCTGGG TGCAGACCGG CCACGTGTTC ACGACGAAGA TCGGGACTCC GATCGAACCG
GACAACCTGC GCCGGTTCTG GCTGCCGCTG CGCCGTGCGG TCGGGCTCGA CGGGGTGGTG
TTTCACGGGC TGCGGCACAC CTGCGTGACG CTGCTCCTGG ACCTCGGTGT GCCGCCGCAC
ATCGTCCGGG ACATCGCGGG CCACTCGGCG ATCGAGGTGA CGATGACGAT CTACGCGCAT
GCCTCGATGG GGGAGAAGCG GCGCGCCCTG GGCAAGCTCG ACGGCCACCT GACCGAGCCC
CGGGACTGA
 
Protein sequence
MPSNTNGPAN GRRKERTKRA NGQGSIYQRG DGLWAGAAYV LMPDGTTKRR PVYGKSEEIV 
RGKLTELQAN SDQGIPADAT GWTVERYLTA WLAQTVKPNR QPNTYVTYEK AVRIYLVPGL
GKKRLNRLTG ADVRQFIRRT ESTCRCCARG VDKARPEEER RCCAVGRCCK RLPSKRQVQV
VHAVLRNALQ AAVREELIRR NVAKLVQVST PRYGVDRGLT VEQAHQLLDA AAGDRLYALL
VLALFLGMRR GELLGLQWSD IDSERETLTV RHTLLRVGGE LRLCPPKTED SERTLPLLGL
VADALAEHRK LQDAERAAAG DAWVQTGHVF TTKIGTPIEP DNLRRFWLPL RRAVGLDGVV
FHGLRHTCVT LLLDLGVPPH IVRDIAGHSA IEVTMTIYAH ASMGEKRRAL GKLDGHLTEP
RD