Gene Franean1_5534 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5534 
Symbol 
ID5673864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6703155 
End bp6704435 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content71% 
IMG OID641244390 
Productintegrase family protein 
Protein accessionYP_001509794 
Protein GI158317286 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCGC GCAAGGGGCG GCGGGGCGCC GGTGAGGGCG CGATCTACAA GGACAGCTCC 
GGCCGCTGGC GGGCGACCGT AGACCTCGGC TGGAAGGACG GAAAGCGGCA GCGGAAGTAC
CTCTCGGGCA AGACCCGGAC GGAGGTCGCG GAGAAGCTGC GGGCGCTGCG TCGGGAGCAG
GAAGACGGCG TGGCTGTGGT CACCGGGGTC AAGCTGCTCA CTTTGGAAGA GTGGCTCACG
TTCTGGCTGG ACACGATCGC GGCGAGGAAG GTGCGGCCGT CGACGTTGGC GACCTACCGG
GGATACGTGC GGAACCGGAT CGTCCCGGCA CTCGGCGCGG TTCGGCTCGA CCGGCTGACC
CCGGAGCATC TCGAAGCCTT CTACCACCGG TGCGAGAAGG AAGGACTGGC GGCGGCCACG
GTGCTCCAGA TGCACCGGAT CGTCTCCCGG GCGCTGAAGG TCGCGCACCG GCGGGGGCGG
GTTGCCCGGA ACGTGGCGAC GCTGGTTGAT CCTCCCTCCG TCGACCGTGA CGAGATCCGG
CCGCTCACGG CATCGGACGC GCGGGCCGTC CTCGACGCGG CGAAGGGACA GCGCAACTCG
GCCCGCTGGT CGGTCGCCCT TGCGCTCGGA CTCAGGCAGG GTGAGGCTCT GGGGCTGGCG
TGGGACGCGG TGAAGCTGGA TGCGGCCCCG GCCACGCTGA CGGTCCGGCA GGCGCTGCAA
CGGCGGCGCT GGGAGCACGG CTGTACCGAC CCGAAGACCT GCGGCACCGC GCGCCGATGC
CCTCGTCGCA CCGGCGGTGG CCTGGTGATC GTGCGCCCGA AGAGCCGCGC GGGCCGGCGG
ACGATCGTCA TCCCCGAGAA CCTGGCGGCA AGCCTGCGGG CTCACCGTTC CGCACAGCGC
GCCGAGCGCC AGGCGGCCGA TGGCGAGTGG GTCAACGAAC ACGAGCTGGT GTTCGTCCAG
CCGAACGGGC GTCCACTCGA CCCGCGGGCG GACCACCGGG CCTGGCAGGA CCTGCTGTCA
CAGGCCGGGG TGCGCGCGGC CCGGCTGCAT GATGCCCGGC ACACGATGGC AAGCCTGCTG
CTCGCGCAGA AGGTCCACCC GCGGGTGGTC ATGGAGATCA TGGGGCACTC GCAGATCAGC
CTGACCCTCG GCACCTACAG CCACGTCGCA CCGGAGCTGT CGACGGACGC GGCGGACCGG
ATGGGCTCCG CTCTATGGGG GGAGACAACC CCGGCCACGG ACGGTGAAGA AGGCCGAAAC
CAGGAAAACG AGAAAGGCTG A
 
Protein sequence
MAARKGRRGA GEGAIYKDSS GRWRATVDLG WKDGKRQRKY LSGKTRTEVA EKLRALRREQ 
EDGVAVVTGV KLLTLEEWLT FWLDTIAARK VRPSTLATYR GYVRNRIVPA LGAVRLDRLT
PEHLEAFYHR CEKEGLAAAT VLQMHRIVSR ALKVAHRRGR VARNVATLVD PPSVDRDEIR
PLTASDARAV LDAAKGQRNS ARWSVALALG LRQGEALGLA WDAVKLDAAP ATLTVRQALQ
RRRWEHGCTD PKTCGTARRC PRRTGGGLVI VRPKSRAGRR TIVIPENLAA SLRAHRSAQR
AERQAADGEW VNEHELVFVQ PNGRPLDPRA DHRAWQDLLS QAGVRAARLH DARHTMASLL
LAQKVHPRVV MEIMGHSQIS LTLGTYSHVA PELSTDAADR MGSALWGETT PATDGEEGRN
QENEKG