Gene Franean1_1457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1457 
Symbol 
ID5669861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1753223 
End bp1754365 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content68% 
IMG OID641240377 
Productintegrase family protein 
Protein accessionYP_001505803 
Protein GI158313295 
COG category[L] Replication, recombination and repair 
COG ID[COG4973] Site-specific recombinase XerC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.417277 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCAGGC CCTCACTGAA CCTCGGGACC TACGGCAACC TGTACACGGC GAAGTACGGC 
GACGGCTACC GGGCACGCGC CCGGTACCGC GACTTCGACG GCCGCACGCG GCTCGTCGAA
CGCCACGCCA AGACCAAGGG CGCCGCCGAG CAAGCCCTAC GCACGGCCCT GCGGGACCGC
GCCCGTGTCG ACGTCGGCAC CGGCGCCATC ACCGCCGAAG CAAAGGTCGC CGTTCTTGCC
GAGGCCTGGT ACGAGTCCGT CCAGCGGCAG GACCGCTCCC CCAACACCAC CGCCGCCTAC
CGGACCCGAC TCGACAAGTC CGTGATCCCT GGCCTCGGCG AGCTACGTAT CCGAGAGCTG
ACGGTCGGCG TCGTCGACCG CTTCCTGTCC ATCATCGCCG AGAAGCATGG ACCGGCCGCC
GCCAAGCAGA CCCGCGCCGT CCTCTCCGGC ATGTGCGGTC TCGCGGCCCG CCACGACGCC
CTGGACCGCA ACGTCGTACG CGACGCCGGA CCGATCGCCG AGGCCACCTC CAAGGATCAG
CCGCGGGCCC TCACCCTCGA CCAGCTCCGA GAGCTGCGCG TCGCGCTCCG GAACGACCCG
AAGGCCGTCG GGCGCGACAT CCCGGAGTTC GTCGACCTAC TCATGAGCAC CGGCGTCCGC
ATCGGCGAAG CCGCCGGGCT GACGTGGGGT GCAGTGAACC TCGACGAGGG CTGGATCGAG
ATCCGCTCGA CCGTCGTACG GATCAAGGGC CAAGGTTTGT TCAACAAGCC GAAGCCCAAG
ACCAAGGCCG GCCACCGCCG CCTGCAACTC CCATCCTGGA TGATCCGGAC CCTGAAGCAA
CGCTTCGACA ACCAGCCGGA CGACGTGACG GTCTTCCCCG CTCAGCTCGG CGGACTACGC
GACCCGTCAA ACACTCAGGC CGACCTACGC GACGCATTCA AGGCCGTCGG CATGGAGTGG
GCAACCTCCC ACATCGTCGG CCGCAAGTCC GTCGCCTCAG CAATGGATAG CGCTGGCCTT
ACAGCACGCG CCGCCGCCGA CCAGCTCGGA CACCGCCAAG TGAGCCTCAC TCAAGACCGC
TACTTCGGCC GCAAAGTCGC CGAGACTGGG GCAGCCGCGA TACTTGAAGA ACTGGATGTT
TGA
 
Protein sequence
MARPSLNLGT YGNLYTAKYG DGYRARARYR DFDGRTRLVE RHAKTKGAAE QALRTALRDR 
ARVDVGTGAI TAEAKVAVLA EAWYESVQRQ DRSPNTTAAY RTRLDKSVIP GLGELRIREL
TVGVVDRFLS IIAEKHGPAA AKQTRAVLSG MCGLAARHDA LDRNVVRDAG PIAEATSKDQ
PRALTLDQLR ELRVALRNDP KAVGRDIPEF VDLLMSTGVR IGEAAGLTWG AVNLDEGWIE
IRSTVVRIKG QGLFNKPKPK TKAGHRRLQL PSWMIRTLKQ RFDNQPDDVT VFPAQLGGLR
DPSNTQADLR DAFKAVGMEW ATSHIVGRKS VASAMDSAGL TARAAADQLG HRQVSLTQDR
YFGRKVAETG AAAILEELDV