Gene Francci3_1119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1119 
Symbol 
ID3905461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1332065 
End bp1333237 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content72% 
IMG OID637878451 
Producttransposase, IS4 
Protein accessionYP_480228 
Protein GI86739828 
COG category[L] Replication, recombination and repair 
COG ID[COG5433] Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCCG CACCAGTATT ACCGCCTGCC CCTGTTCTCG ATCGGCTGGC TGCCGTCGGC 
GCCGGCAACC AGCCTCCGTC GCCGGCGGGT CTGCTGGCGG TCTTCAACCA GCTACCGGAC
CCCCGGAAGC CCCGCGGCAG GCGGCACAGC CTGGCCGCTG TGCTGACGCT GGCGACCTGC
GCGGTGCTCG CCGGGGCGCG GTCGTTCACC GCGATCGGTG AATGGTCCGC CGACGCCGGC
CAGGCGGTCG CGGGCCTGCT CGGCGTCTCT CGAGTGCCCG AGGAGTCGAC GTTTCGCCGG
GTACTCGCCG CACTCGACGC CGACGCCCTG GATACGGCGC TGGGAGCATG GGCCGCCGCG
GCGACCACCC CGCCGGCCGG GACGCGGCGG CGGCTCGCGG TCGACGGCAA GACGCTGCGC
GGCTCCCGCA CGCCTGACAG TCCGGGCCGC CACCTGCTCG CCGCGCTCGA CCACACCAGC
GGTGTCGTGC TCGGCCAGGT CGCCGTCGAC GCGAAGTCGA ACGAGATCCC GGCGCTGCCC
GTCCTGCTCG CCGACCTGGA CCTGACCGAT GTGATCGTGA CCGCGGATGC TCTACACACC
CAGCGACAGA CCGCATCCTG GCTGGTCAGC CGGCATGCGC ACTACATCCT GACGGTGAAG
GCCAACCAGC CGGCGCTATA CGCCCAGCTC GCCGCCCTGC CCTGGCGCCG GGTGAAGACC
GCCGCGCGCA CCGTCGAACG CGGCCACGGC CGCCGCGAGC GGCGCACCGT GAAGACCACC
GAGGTCCGCG CGGGACTACT CTTCCCGCAC GCCGTGCAGG CAGTGCAGGT CACCCGCCGC
CGCCAGCCGC TCGCCGACGG GCCGGCCACG ACCGAGATCG TCTACCTTGT CACCAGCCTG
CCGACCCACC AGGCCAGCCC CACGCTGCTG GCCACCTACG CCCGCGAGCA CTGGCTCGTG
GAGAACCGGC TGCACTGGGT CCGCGACGTC ACCTTCGGGG AGGACCTCAG CCAGGTCCGG
ACCGGCCACG CTCCCCAGGT CATGGCCAGC CTGCGCAACC TGGCGATCGC GATCCTTCGC
CTGACCGGCG CCACGAACAT CGCCCAGGCA ATCCGACACC ACGCGCGGCG CCCCGAACGA
CCACTAGAGA CGATCAAGAG CCTTGCTTGC TAA
 
Protein sequence
MPAAPVLPPA PVLDRLAAVG AGNQPPSPAG LLAVFNQLPD PRKPRGRRHS LAAVLTLATC 
AVLAGARSFT AIGEWSADAG QAVAGLLGVS RVPEESTFRR VLAALDADAL DTALGAWAAA
ATTPPAGTRR RLAVDGKTLR GSRTPDSPGR HLLAALDHTS GVVLGQVAVD AKSNEIPALP
VLLADLDLTD VIVTADALHT QRQTASWLVS RHAHYILTVK ANQPALYAQL AALPWRRVKT
AARTVERGHG RRERRTVKTT EVRAGLLFPH AVQAVQVTRR RQPLADGPAT TEIVYLVTSL
PTHQASPTLL ATYAREHWLV ENRLHWVRDV TFGEDLSQVR TGHAPQVMAS LRNLAIAILR
LTGATNIAQA IRHHARRPER PLETIKSLAC