Gene Francci3_2347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2347 
Symbol 
ID3904554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2719661 
End bp2722411 
Gene Length2751 bp 
Protein Length916 aa 
Translation table11 
GC content73% 
IMG OID637879677 
Producttransposase Tn3 
Protein accessionYP_481443 
Protein GI86741043 
COG category[L] Replication, recombination and repair 
COG ID[COG4644] Transposase and inactivated derivatives, TnpA family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.669911 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.26387 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGGTGG AGTTCCTGAC GGATGAGCAG GTTGCGGCGT ACGGGCGGTT CGATGGTGAA 
CCTGTGCGGG CTGAGTTGGA GCGGTTCTTC TTTCTGGATG ACGCCGACCG GGAGCTCGTC
GTCCGGCGCC GGGGCGAGCA CTCGCGTCTG GGGTTCGCCG TTCAGCTGGG CACGGTGCGG
TTCCTTGGGA CGTTCCTGAG CGATCCGCTG GAGGTGCCGT GGGTGGTGGT GGACTATCTC
GCGGGCCAGC TGGACGTTGC GGACTCGTCG GTGGTGAAGC GGTACACCGA GCGGCTGCCG
ACGCAGCATG AGCATGCTCG GGAGATCCGG TCGGTCTACG GCTACCGGGA CTTCGCCGAC
CCGCAGGCAG CCACGGAGCT GCGGGCGTTC CTGGCGAGCC GGGCGTGGAC GCAGGCAGAG
GGTCCGGTGG CGCTGTTCGG TCAGGCCACG GCGTGGCTGC GCCGTAGCCG GGTGCTGTTG
CCAGGGGTGA GTGTTCTGGC CCGTCTGGTG GTCTCCGTGC GCAGCGAGGC CACCGACCGG
GTGCACCAGG AGCTCGCCGT CGCGGCTGAG CAGGCCGATC CGCAGTTGGC CGGACGGCTG
CGCAGTCTCC TCGAGGTGGC GCCGGGTGCC CGGGTAAGCG AGCTGGAGCG GCTGCGGCGG
GGGCCGACGC GGACGTCGGG GCCGGGCTTG GAGCGGGCGC TGTCACGGGC CGGCGAGGTC
ATCGTGGTGG GCGCCGGCGC GGCGGATGTG TCGGGGGTGC CAGCGAACCG GCTGGCGACG
CTGGCCCGCT ACGGCCTGGC GGCCAAGGCG CCGCTGCTAC GCGAGCTCGC CGAGCCGCGG
CGTACCGCGA CCCTGCTGGC GACGGTGCGC GGTCTGGAAG CGGCGGCGGT CGACGACGCC
CTGGACCTGT TCGACGTGCT GATGGCCACC CGGCTGCTCA ACCCGGCGCG TCGCGCCGCG
CAGCAGGAAC GCCTCGCCGT CCTGCCGAAG CTGGAGAAGG CGTCGGTCAC CCTCGCCGGC
GCGGCGTCCG CGCTGCTGGG CCTGCTGGCC GGCACGGCCG GCGAGGCGCT CGACGTCGCC
GCAGCCTGGG CGGTCGTGGA GCAGGTCGCG CCGCGGGAAC GGGTGCTGGA GGCCGCCGCT
CTGGTCGAGG AGCTGGTGCC CGGCGACGCC GGCCTGGAGA CGGTGATGCG CGCCGAGCTG
GCCCGCCGTT ACGGCACGGT CCGCCCTTTT CTCCTCCTGC TCGCCGAGGC GCTGCCGCTG
GCAGCGACCG GCGACGGCCA GCCGGTCCTC GACGCGGTCC GGCTACTGCC GAACCTGGTC
GGCCGTCGCC GGATCCGCCC GGACGAGGTC GACATCGATC TGGTGCCGCC GGTGTGGCGG
CGCGCGGTGC TGGCCAACCC GACGTTGCCG GCTGGCACGG TCGACCGCGA CGGCTACGTG
CTGTGCGTGC TGGAGCAGCT GCATCGGGCG CTGCGCCGGC GCGACGTGTT CGCCGTGCGT
TCGCAGCGAT GGTCTGACCC GCGGGCCCGG CTGCTCGCCG GCGAGGGATG GGATCGGGTC
CGCGGCGAGG TGCTCGCCGG TCTCGGCCTG GCCGCCCCGG TCGAGACCCA CCTGCGTGAG
CTCGCCGGCT CGTTGGACGC CGCGTGGCGC CAGACCGTCG ACCGGCTCGC CGAGGCCGGA
CCGGACAGCC CGGTGCGGGT CGAGCCGGCG GCCGACGGGC GGATGCGCCT GGCCGTGAGC
CGGCTGGAGG CGCTCGGGGA ACCGGACAGC CTGGTCGCGC TGCGGGCGAC AACCGCGGCG
ATGCTGCCCC GTGTCGACCT GCCCGATCTG CTGCTGGAGG TGCACGCCTG GACCGGGTTC
CTCGGCGCCT ACACCCATCT GGGCGGGTCG GGCAGCCGGA TGGAGAACCT GCACATCTCC
CTCGCAGCGC TGCTGGTCGC CGAGGCATGC AACGTCGGGC TGACCCCGGT GACCAAGCCG
AGCGAGCCGG CGCTGAGCCG CGACCGGCTC TCCCACGTCG ACCAGAACTA CCTGCGCGCC
GACACCCACG CCGCGGCGAA CGCGGCTCTG ATCACCGCCC AGGCACAGAT CGGGCTCGCC
CAGGCATGGG GCGGGGGGCT GGTCGCCTCC GTCGACAGAC TGCGGTTCGT CGTCCCGGTC
CGCACGATCA ACGCAGGACC CAGCCCCCGC TACTTCGGGC TCAAACGCGG CGTGACCTGG
CTGAACGCGG TCAACGACCA GGTCGCAGGG ATCGGCGCGG TCGTCGTGCC CGGCACTGTC
CGTGACTCGC TCTACATCCT GGACACGCTG CTCACCCTCG ACGCCGGGCC GAAGCCGGAG
ATGGTCGCGA CCGACACCGC CAGCTACTCC GACATCGTCT TCGGCCTGTT CCGCCTGCTC
GGCTACCGGT TCTCCCCGCG GATCGCGGAC CTGTCCGACC AGCGACTGTG GCGTGCCACC
CTGCCCGGCG ACGCCGAGGG CGACTACGGG CCGCTGAACG CCATTGCCCG GCACCGGGTC
AACCTCGCCC GCGTCGGTGT CGTGCTGTGG AACACCCGCT ACCTCGACGC CGCCCTCACC
GCACTGCGCA CCGCAGGCCA CGACGTGCAC GACCTCGACG TCGCCCGGCT GTCCCCGCTG
GCCGACCGGC ACATCAACAT GCTCGGCCGC TACGCCTTCG CCGCCCCGCC CACCGGCACG
GGTCTGCGCC CTCTGCGTGA CCCGGGCGAG AACGATCAGG AGGAGTCATG A
 
Protein sequence
MPVEFLTDEQ VAAYGRFDGE PVRAELERFF FLDDADRELV VRRRGEHSRL GFAVQLGTVR 
FLGTFLSDPL EVPWVVVDYL AGQLDVADSS VVKRYTERLP TQHEHAREIR SVYGYRDFAD
PQAATELRAF LASRAWTQAE GPVALFGQAT AWLRRSRVLL PGVSVLARLV VSVRSEATDR
VHQELAVAAE QADPQLAGRL RSLLEVAPGA RVSELERLRR GPTRTSGPGL ERALSRAGEV
IVVGAGAADV SGVPANRLAT LARYGLAAKA PLLRELAEPR RTATLLATVR GLEAAAVDDA
LDLFDVLMAT RLLNPARRAA QQERLAVLPK LEKASVTLAG AASALLGLLA GTAGEALDVA
AAWAVVEQVA PRERVLEAAA LVEELVPGDA GLETVMRAEL ARRYGTVRPF LLLLAEALPL
AATGDGQPVL DAVRLLPNLV GRRRIRPDEV DIDLVPPVWR RAVLANPTLP AGTVDRDGYV
LCVLEQLHRA LRRRDVFAVR SQRWSDPRAR LLAGEGWDRV RGEVLAGLGL AAPVETHLRE
LAGSLDAAWR QTVDRLAEAG PDSPVRVEPA ADGRMRLAVS RLEALGEPDS LVALRATTAA
MLPRVDLPDL LLEVHAWTGF LGAYTHLGGS GSRMENLHIS LAALLVAEAC NVGLTPVTKP
SEPALSRDRL SHVDQNYLRA DTHAAANAAL ITAQAQIGLA QAWGGGLVAS VDRLRFVVPV
RTINAGPSPR YFGLKRGVTW LNAVNDQVAG IGAVVVPGTV RDSLYILDTL LTLDAGPKPE
MVATDTASYS DIVFGLFRLL GYRFSPRIAD LSDQRLWRAT LPGDAEGDYG PLNAIARHRV
NLARVGVVLW NTRYLDAALT ALRTAGHDVH DLDVARLSPL ADRHINMLGR YAFAAPPTGT
GLRPLRDPGE NDQEES