Gene Francci3_4227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4227 
Symbol 
ID3907193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5041868 
End bp5043166 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content71% 
IMG OID637881553 
Producttransposase, IS4 
Protein accessionYP_483302 
Protein GI86742902 
COG category[L] Replication, recombination and repair 
COG ID[COG5659] FOG: Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTGTAC TGACTGTCGT CTTCATGCGT GTGGCGGTGG TTGGTATGGA ACTGGGTGAG 
ATGGGCCGGG TACGTCCGGT GATCGAGCGG TTTGCGGGTG AGATGTTCGC GGACCTGCCC
CGGCGGGACC AGCGGGGTAA GGGTGAGCTG TACGTGCAAC GGCTGCTGAC CGACGGCAAG
CGCAAGAGCA TGGTCCCGAT GGCCGCCCGC CTCGGTGTGG ACCCTCAGCA GCTGCAGCAG
TTCGTCACCA GCTCGACCTG GGACTACCGC CAGGTCCGGC GGCGGCTGAC GGGCTGGGCG
GCCGGGTTCT GCGACCCGGT CGCGCTGGTC GTGGACGACA CCGGCTTTCC CGAAGGACGG
GCCGCCTCCC CCGGGGTGGC CCGGATGTAC TCCGGCACCC TGGGCAAGGT CGGGAACTGT
CAGATCGGGG TGTCCGTGCA CGCGGTCACC GACTGGGCGT CGGCCGCGGT GGCCTGGCGG
CTGTTCCTCC CGACCTGCTG GGACGACACC ACCCTGACCG ACCCGACCGA GGTCGCCGCC
GCCCGGGCCC GGGAACGGGC CGCGATCCCC GACAAGGCGC GGCACCGGGA GAAATGGCGG
CTCGTCCTGG ACATGATCGA CGAACTGGCC GGCTGGGGCA TGCCTGTGCG GCCGGTCGTG
GCGGACGCCG GCTACGGCGA CGCCGCTGCC TTCCGCCAGG GGCTGACCGA CCGGAACATC
CCCTACGTGC TGGCGGTGAA GCCGACCGCG ACCGCCTACC CGGCCGACGC GGTGCCGGTC
ACCGCCCCCT ACCCGGGAAA CAGCCGGCGG CCCACACCCG CCTACCCCGA CCCGCCCCGC
GATCTGAAAT CCCTGGTCAT GGCCGCCGGC CGCCGCACGG GCCGGTCTGT GACCTGGCGT
CACGGCACCC ACCGGACCCC GGCCAACCCG ACCGCGGGGA TGCGGTCCCG CTTCCTCGCG
CTCCGGGTCC GCCCCGCGGG CCGGAACATC ACCCGTAACC CCGACCGGAG CCTGCCCGTC
TGCTGGCTGC TCGCCGAATG GCCAGTCGGC CAGCCCGAAC CCACCGATTA CTGGCTGTCC
ACCCTGCCCA CCGGCATCCC CCTGCGCGAT CTTGTTCGTC TCGCGAAGAT CCGCTGGCGG
ATCGAACACG ACTACCGCGA GTTGAAAGAC GGCCTCGGCC TCGACCACTT CGAGGGCCGA
ACCTTCGCCG GCTGGCACCG TCACGTCACC CTCGTCAGGG TCGCCCAAGC CCTCTGCACC
CAGCTGAGAC GAACCCCAAA AGTCCCTGCG CCGGCCTGA
 
Protein sequence
MVVLTVVFMR VAVVGMELGE MGRVRPVIER FAGEMFADLP RRDQRGKGEL YVQRLLTDGK 
RKSMVPMAAR LGVDPQQLQQ FVTSSTWDYR QVRRRLTGWA AGFCDPVALV VDDTGFPEGR
AASPGVARMY SGTLGKVGNC QIGVSVHAVT DWASAAVAWR LFLPTCWDDT TLTDPTEVAA
ARARERAAIP DKARHREKWR LVLDMIDELA GWGMPVRPVV ADAGYGDAAA FRQGLTDRNI
PYVLAVKPTA TAYPADAVPV TAPYPGNSRR PTPAYPDPPR DLKSLVMAAG RRTGRSVTWR
HGTHRTPANP TAGMRSRFLA LRVRPAGRNI TRNPDRSLPV CWLLAEWPVG QPEPTDYWLS
TLPTGIPLRD LVRLAKIRWR IEHDYRELKD GLGLDHFEGR TFAGWHRHVT LVRVAQALCT
QLRRTPKVPA PA