Gene Franean1_1615 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1615 
Symbol 
ID5670018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1932554 
End bp1933816 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content70% 
IMG OID641240534 
ProductIS891/IS1136/IS1341 family transposase 
Protein accessionYP_001505960 
Protein GI158313452 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00623083 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGACGA CGCTGCAGGC GTACCGGTTC GCGCTCGACC CGAACAACGT CGGGCGTGCC 
GGCTTGCGCC GGCACGCGGG CGCGTGCAGG TTCGCGTTCA ACTGGGGCCT GGCTCGGGTG
AAGGCCGCGC TGGCGCAGCG CGAGGCCGAG GAATCCTACG GGCTGGCCGG CGACCTGCTC
ACCGCGGTTC CGTGGACGCT GCCCGCGCTG CGCCTGGCCT GGAACACGGT GAAGAACGAC
ATCGCGCCGT GGTGGGCGGA GTGCTCGAAG GAGGCGTTCT CCGCCGGGCT GGCGCAGTTG
GCCGCCGGGT TGAAGAACTT CTCCGACTCC CGCAAGGGCA AACGGAAAGG CCGCACGGCC
GGTTTCCCCC GGTTCAAAAA GCGCGGGAAG GACCGTGACT CGTTCCGGTA CACCACCGGC
TCCTACGGTC CGGACGGGGA CCGGCATGTG AAACTGCCCC GGATCGGCCG GGTGAAGGTC
CACGAGCCGA TGGGCGCGCT CACCGCCCTG CTCGGCGACG GCCGTGCCCG CCTCCTCGGC
GCGGCCGTGT CCCGCACGGC TGGCCGCTGG TTCGTGTCGT TCACCGTGCA GGTCGAGAAG
AAGCTCCGCA GGACCTCCCG CGCCTACGCC CGGTCACAGC CGGGCAGCAG CGGCAGGCGC
AAGCTCGCCA CCGACCTGGC GAAACAGCAT GCCCACACCG CCAACCAGCG GCGCGACGGG
CTACACAAGG TCACCACCAA CCTCGCCCGG ACCCACCACA CGGTGGTCAT CGAGGATCTG
CACGTCGCCG GCATGGTGCG TGACCACAGC CTGGCGAAGG CGGTCTCCGA CGTCGGGATG
GGCGAACTGC GCCGCCAGTT GGAGTACAAG TGCGGCCGGT GGGTTCGCGA CCCGAAGACC
AGGACGCAGG TGTACGTGCC CGGCTGGCAT GGCGCGCACC TGCACGTCGC GGACCGTTGG
TACCCAAGCT CGAAGACCTG TTCCGGCTGT GGCTGGCGAA ACCCAAGCCT GACACTGTCG
GACCGCACCT TCTCCTGCCC GTCCTGCGGG CTGGTGATCG ACCGCGACGA GAACGCGGCG
GTCAACCTGG CTCGGCTCGT CGACCGCGAG TACATCGGCG ACGTTAAAAC AGCCCGTGGA
GCCGACCGTA AGACCAACGC GCCAGCACCA CCGGCGCGGC GGCGGGTGGC TGTGAAGCGG
GAACCGGGCA CGGCCAAGAC CGGTCAGACC CGGGGTGCCT CACCGAAAGG TGAAGCGGCA
TGA
 
Protein sequence
MKTTLQAYRF ALDPNNVGRA GLRRHAGACR FAFNWGLARV KAALAQREAE ESYGLAGDLL 
TAVPWTLPAL RLAWNTVKND IAPWWAECSK EAFSAGLAQL AAGLKNFSDS RKGKRKGRTA
GFPRFKKRGK DRDSFRYTTG SYGPDGDRHV KLPRIGRVKV HEPMGALTAL LGDGRARLLG
AAVSRTAGRW FVSFTVQVEK KLRRTSRAYA RSQPGSSGRR KLATDLAKQH AHTANQRRDG
LHKVTTNLAR THHTVVIEDL HVAGMVRDHS LAKAVSDVGM GELRRQLEYK CGRWVRDPKT
RTQVYVPGWH GAHLHVADRW YPSSKTCSGC GWRNPSLTLS DRTFSCPSCG LVIDRDENAA
VNLARLVDRE YIGDVKTARG ADRKTNAPAP PARRRVAVKR EPGTAKTGQT RGASPKGEAA