Gene Franean1_1784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1784 
Symbol 
ID5670186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2144215 
End bp2145342 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content66% 
IMG OID641240705 
ProductIS605 family transposase OrfB 
Protein accessionYP_001506128 
Protein GI158313620 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.125878 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.304536 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGCGGG CCTACAGGTA CCGCTTCTAC CCGACTGTCG AGCAGGCCGA GCAGTTGGCG 
AGAACGTTCG GCTGTGTGCG CTACGTGTAC AACCGGGCGT TGGCGGAGCG GCACCGGGTC
TGGTTCCAGG AGCAGCGCCG GGTCACGCAT GCCGAGACGG ACAAGATGCT GACGGCGTGG
AAACGCGACC CGGAAACGGC GTGGCTGGCC GAGCCGTCGA AGGGGCCGTT GCAGGCGACG
CTGCGGCATC TGCAGACCGC CTACGTGAAC TTCTGGGAGA AGCGGGCCGG TTACCCGTCC
TTCAAGAAGA AGGGCAGGAC CCTCGACTCG GCGACCTACT TCCGGAACTG CTTCAGTTTC
CGCGACGGGC AGGTCAGGCT GGCGAAACAG GATCTGCCGT TGGACATCGC CTGGTCGCGT
CCGCTGCCCG AGGGTGCGGC GCCGTCCCAG GTGACGGTGT CGCGTAACAC CCGCGGCCAG
TACCACATCT CGATCCTGGT CGAGGAGACC ATCAGCAGCC TGCCTCCGTC GCCGGCACAG
GTGGGGGTCG ATGCGGGTGT CACGTCCCTG GTTGCCTTGT CGACGGGCGA GAAGGTGACC
AACCCGTGGC ACGAGCGGGC TGACCGTGCC CGGCTCGCCC GCGCGCAGCG GGAACTGTCC
CGTAAACGGA AGGGTTCGGC GAACCGGGCC AGGTCCCGGC TCACCGTGGC GCGTATCCAC
GGGCGGATCG CCGACCGGCG CCGGGATCAT CTGCACAAGC TGTCCACGAG GATCATCCGC
GAGAACCAAA CGGTGGTCAT CGAGGACCTG GCGGTCCGCA CCATGGTCCG TAACCATTCG
CTGGCACAGG CGATTTCCGA CGCTTCCTGG TCGGAGCTAC GGCGGATGTT GGAGTACAAG
GCCGACTGGT ATGGCCGCAC GGTGATCGCG GTCGACCGTT TCTACCCGAG CAGCAGGACC
TGCTCGGCCT GTGGGTCGAT CGTCGAGAAG CTGCCGTTGA ACGTACGGGA GTGGGAGTGC
CGCTGCGGCG CGCACCACGA CCGGGATGTC AACGCTGCGA AGAACATTCT GGCCGCGGGG
CTCGCGGTGT CTGCCTGTGG AGACGGAGTG AGACCACCTC GCTCCTAG
 
Protein sequence
MKRAYRYRFY PTVEQAEQLA RTFGCVRYVY NRALAERHRV WFQEQRRVTH AETDKMLTAW 
KRDPETAWLA EPSKGPLQAT LRHLQTAYVN FWEKRAGYPS FKKKGRTLDS ATYFRNCFSF
RDGQVRLAKQ DLPLDIAWSR PLPEGAAPSQ VTVSRNTRGQ YHISILVEET ISSLPPSPAQ
VGVDAGVTSL VALSTGEKVT NPWHERADRA RLARAQRELS RKRKGSANRA RSRLTVARIH
GRIADRRRDH LHKLSTRIIR ENQTVVIEDL AVRTMVRNHS LAQAISDASW SELRRMLEYK
ADWYGRTVIA VDRFYPSSRT CSACGSIVEK LPLNVREWEC RCGAHHDRDV NAAKNILAAG
LAVSACGDGV RPPRS