Gene Francci3_3267 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3267 
Symbol 
ID3904438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3869452 
End bp3870522 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content72% 
IMG OID637880592 
Producttransposase, IS605 OrfB 
Protein accessionYP_482353 
Protein GI86741953 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.244566 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCGAAG GAGGCGTTCC GCGCTGGCCT GGATCAGCTT GCCCGGGGGT TGAAGAACTT 
CACCGACTCC CGGTAAGGGA AGCGGAAGGG CCGGCGGGTC GGTTTCCCCG GTTCAAGAAG
CGCGGGAGGG CCCGTGACTC GTTCCGGTAC ACCACCGGCG CCTACGGTCC GGCTGGGGAT
CGGCAGGTGA AGCTGCCCCG GGTCGGCCGG GTCAAGGTGC ATGAGCCGAT GGGTGCGCTG
ACCGGCCGGC TCGTCGACGG GTCCGCACGC CTGCTGGGCG CGACCGTGTC CCGGACCGCG
GGGCGCTGGT TCGTGGCGTT CACCGTGGAG GCCGACCGTG ATGTCCCCGG CAAGCCGTCT
ACCCGGCAGC GCCGGGGCGG CCCGGTCGGG GTCGACCTGG GTGTGAGGCA CCTCGCCGTC
CTGTCGACCG GCGAGACCGT CGAGAACCCC AGGCCGCTGG CCCGGTCGCT GCGCGAACTG
CGCCGCGCGT CGCGGGCCTA CGCCCGCTCC ACGCCCGGCA GCGCCGGTCG CCGTAGACAC
GCCGCCACCC TCGGTCGCCT CCATGCCCGC GTCGCCTACC AGCGCCGCGA CGGGCTGCAC
AAGCTCACCG CCCGGCTGGC CAAGACCCAC GACACGATCG TGGTGGAAGA CCTGCACGTG
GCCGGGATGG TCCGTAACCG CCGTCTTGCC CGTGCCGTGT CCGATACCGG GATGGCCGAG
GTCCGCCGCC AGCTCGCCTA CAAGACCCTC TGGTACGGAT CGACGCTCGT GGTCGCGGAC
CGCTGGTATC CAAGCTCGAA GACCTGTTCC GACTGCGGCT GGCGAAACCC AGGCCTCACC
CTGTCGGAGC GTATCTTCGC CTGCCAGTCC TGCGGGCTGG TAGGCGGCCG CGACCTGAAC
GCCGCGGTCA ACCTGTCCAA CCTCGTCGCC GCCAGTAGGT CGGAGACGGT AAACGCCCGT
GGAGCCGACC GTAGGACCCC GTCCGCGGGG CGGGCGGCTG GGAAACGGGA ACCCGGCACG
GCTCGGGCCG GTCGGACCGG GAGTGCTTCA CCGCAAGGTG AGGCGGCATG A
 
Protein sequence
MLEGGVPRWP GSACPGVEEL HRLPVREAEG PAGRFPRFKK RGRARDSFRY TTGAYGPAGD 
RQVKLPRVGR VKVHEPMGAL TGRLVDGSAR LLGATVSRTA GRWFVAFTVE ADRDVPGKPS
TRQRRGGPVG VDLGVRHLAV LSTGETVENP RPLARSLREL RRASRAYARS TPGSAGRRRH
AATLGRLHAR VAYQRRDGLH KLTARLAKTH DTIVVEDLHV AGMVRNRRLA RAVSDTGMAE
VRRQLAYKTL WYGSTLVVAD RWYPSSKTCS DCGWRNPGLT LSERIFACQS CGLVGGRDLN
AAVNLSNLVA ASRSETVNAR GADRRTPSAG RAAGKREPGT ARAGRTGSAS PQGEAA