Gene Francci3_1967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1967 
Symbol 
ID3903675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2308839 
End bp2310002 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content71% 
IMG OID637879304 
ProductISRSO5-transposase protein 
Protein accessionYP_481071 
Protein GI86740671 
COG category[L] Replication, recombination and repair 
COG ID[COG3415] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCTTGGG GGCGTGGCTG GCCCGGTAGG ACCGGGAGGG TGTCCCGCCG TGCCACCGCG 
ATCACCCTGA CCTGCGACGT CCGTGCCGTT CTGGCGGGCC GTGCCCGTTC GTTGACGGTG
CCGCGCCGTG ACTGGCTGCG CGCGGCGATC GTGCTCGCCG CCGCCGACGG TGCGAGCAAC
ACGACGATCG CCACCGATCT CGGTGTCTGC GAGGACACCG TGGGCAAGTG GCGGGGCCGG
TTCGCCCGTG AAGGGCTCGC CGGGCTGGTG GACCGGCCCC GTTCGGGTCG GCCGGCCCGG
TTCACCGCGG TGCAGGTCGC CGAGGTCAAG GCGTTGGCGT GCACCCGGCC CGCGGACGTC
GGTGCGCCGC TGGAACGCTG GTCGAACGCC GAACTCGCCC GCCACGCGGC CCGGGAGGGC
ATCGTGGCGG GCGTGTCAGC GTCGTCGGTC GGACGCTGGC TGGCCCGGGA TGCGATCCGC
CCCTGGCAGC ACCGCTCGTG GATCTTCCCC AGGGATCCGG CGTTCGTCGC GAAGGCCTCC
CGGGTCCTCG ATCTCTACGC TCGGATCTGG GACGGTGCAC CACTCGGCGA GAACGACTAT
GTGATCTCAG CCGATGAGAA GTCCCAACTG CAGGCACTGC GCCGCTGCCA CCCGACCGCG
CCCGCCCAGG CCCGTCACGG ACCGCGCGTC GAGTTCGAGT ACCAACGCGG CGGCACCCTG
GCCTACTTCG CCGCCTACGA CGTCCACCAT GCCCACGTCA TCGGCCGGAT CGAGCCGACC
ACCGGCATCG CCCCGTTTCA CCGGCTCGTC GACCAGGTCA TGACGGCCGA GCCCTACGCC
TCGGCACGCA GGGTGTTCTG GGTCGTCGAC AACGGCTCCT CCCACAACGG CCAGACCTCG
ATCCGCCGGA TGAGCGCGGC CCATCCGACC GCGACGCTCG TCCACCTCCC GGTCCACGCC
TCCTGGTTGA ACCAGGTGGA GATCTACTTC TCGATCCTGC AGCGCAAGGC CATCGAACGC
GGGGACTTCG CCGACCTCGA CGCACTCGGC GACCGGGTCA TGGGCTTCCA GGACCTCTAC
AACCAGACCG CGGCCCCGTT CGACTGGAAG TACACCCGCG CCGATCTCCT CAAGACCGCC
AGCGGCCTCG CCCTCGCCGC ATAA
 
Protein sequence
MPWGRGWPGR TGRVSRRATA ITLTCDVRAV LAGRARSLTV PRRDWLRAAI VLAAADGASN 
TTIATDLGVC EDTVGKWRGR FAREGLAGLV DRPRSGRPAR FTAVQVAEVK ALACTRPADV
GAPLERWSNA ELARHAAREG IVAGVSASSV GRWLARDAIR PWQHRSWIFP RDPAFVAKAS
RVLDLYARIW DGAPLGENDY VISADEKSQL QALRRCHPTA PAQARHGPRV EFEYQRGGTL
AYFAAYDVHH AHVIGRIEPT TGIAPFHRLV DQVMTAEPYA SARRVFWVVD NGSSHNGQTS
IRRMSAAHPT ATLVHLPVHA SWLNQVEIYF SILQRKAIER GDFADLDALG DRVMGFQDLY
NQTAAPFDWK YTRADLLKTA SGLALAA