Gene Franean1_4775 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4775 
Symbol 
ID5673116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5700823 
End bp5703339 
Gene Length2517 bp 
Protein Length838 aa 
Translation table11 
GC content71% 
IMG OID641243631 
Producttransposase Tn3 family protein 
Protein accessionYP_001509047 
Protein GI158316539 
COG category[L] Replication, recombination and repair 
COG ID[COG4644] Transposase and inactivated derivatives, TnpA family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.855357 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATCG AGTACCTCTC CGAGGAGCAG GTCCGCCGGT ACGGGGGGTT CGCCCGGGAT 
CCCTCACCGG GGGAGTTGGA GCAGTCCCTG CGGATGGACC GCGCCACGCT GGCGCTGGTC
GCGTCGAAGC GGCGGGAGGC CAACCGGCTG GGCTGGTCGG TGCAGTGGGC GACGGTGCGG
ATGGTGGGGA CGTTCCTGAC GGATCCTGCT GAGGTGCCCG CGGTGGTCGC GGCGTTCATG
GCCGAGCAGG TCGGGGTGGT GGACCCGGGC TGTCTGAAGG GCTACTCGAC GTTGGAGGAG
ATGCGCACGA CAGTGACCGC GGTGTCCGGG CGGGGCCTGG TCGGGGCGTT GGACCGGGTG
TCGTCGGTGT GGGCGGTGGG GACCGGCGGG GTGGAGGTGG CGGCGGTGCC GCCGGTGAAA
CTCGCCGAGC TCGCCGCCTA CGGGATGGTC ACGAAGGCCA CGACGATCCG GGGCCTGCAC
GATGATCGGA AGGTCGCGAC CGTGCTGGCC ACGGTGCGTC ATCTGGAAGC CGTCTCGGTC
GACGACGCGC TGCTGCTGTT CGACATCCTG ATGGCGACGA AGCTGCTGGC CCGCGCCGAA
CGGGTCAGCG GCACCAAGCG GCTCAAGACG CTGCCCCGCT TCCGTCAGGC CGCCGGACGG
GTCGCGGCCG CAATCACTGT GCTGCTCGAC GTGCCGCAGG CTCGGGACGG GCAGGTGATG
ACCGTGGCGG AGATGTGGAC CGCGATCGAG CAGGTCGTGC CGCGGGAGAA GTTGCAGGCG
GCGCTGGTGA CGGTGGCGGA GTATCTGCCC GACGAGGCCG AGGACGACGA CGCGGACTGG
CGCACCGAGC TGGTCACCCG CTACGCCACG GTGACGGGGT TCCTGGAGCT GCTGGCCGAG
ACGATCGCCT GGGGTGCGAC GCCGGCCGGC GCACCGATCG TGGCGGCGCT GCGGGACCTT
CCGCGGGTCA AGGCACGGCG CGCCCCGGAG GCGGCGCACA TCGGGGAGCA CGCCGGGCTG
GTGACCGGCT CGTGGCGGCG GCTGGTGTAC GCCAACCCGC AGCTGTCCGC ACCGCTGATC
GACAAGCATG CCTACGTGTT CTGCGTGCTC GAACATCTGC ACCGGGCGCT GCGCCGCAAG
GACGTCTACG CCCTGGGCGC GGACCGCTGG GGTGATCCCC GCGCCCGTCT CCTCGACGGC
GACGCCTGGG AGCAGGCCCG CCCCCGGGGG CTGTCCGCTC TCGGACTGCC GGAGCAGCCC
ACCGACCATC TCGCCGAACT CGTCTGTGAC CTCGACGCCG CCTACCGCCA GGTCTCCGCC
GGCCTGCCGA CGAACAGCGC CGTGCAGATC GAGGGCGGGC GGATCCGCCT GGACCGGCTT
GCCGCCGCCC CCGACCCTGC CGGGATGGAG GCGGCCCGCG ACGCCGTGGC CGCGCTGCTG
CCCCGTGTCG ACTACGCGGA GCTTCTGCTG GAGGTCTTCG AGCGCACCGG CCTACCGGGC
GCGTTCACCC ACATCTCCGG TTCCGATGCC CGCCCCGCCG ACTTCGATTG GGGTGGCGGT
CTGGTCGCCG GCGCCGACCG GATGCGGTTC GTCGTCCCGG TCCGGTCGAT CGCCGCCCGG
CCCAGCGCGA AGTACTTCGC CGGCTCGAAA CGCCGCGCCG GCGCGACCTG GCTCAACGTT
GTCTCCGACA AGGTGATGGG CCTCGGGGGC CTGGTCGTGC CCGGCACCCC GCGCGACAGC
CTGCACATCC TCGATGCCCT GCACAACCTC GACGTCGACG AACCCCCGGA GATCATCACC
ACCGATACCG GGTCGTATTC GGATCTTGTT TTCGGCCTGT TCGCGATCAG CGGCTACCAG
TTCTCCCCGC GGATCGCCGA CCTCGCCGAC ACCCGACTGT GGCGCACCCG CACCGACGCC
GACTACGGGC CGCTCAACAC GGTGTCCGGG CACACGGTCA ACCTGGCCCG GATCGTCGAG
CACTGGCAGG ACATGCTCCG CGTCGCCGGC TCCCTGATCA CCGGGGAGGT CCGCGCCCAC
GACATGATCC GCATGCTCAC CCGCGAGGGC AACCCGACCG GGCTCGGCAA CGCCTTCGTC
GCCTACGGCC GGCTGTTCAA GACCCTCCAC GTGCTCCAGG TCCTGCACGA CGAGTCCTAC
CGGCGGATGA TCAGCGCTCA GCAGAACATC ACCGAAGGCC GACACAGCCT CGCCCGGCGG
ATCTTCTTCG GGAACCGCGG CGAGCTCCGC CAGCGGTACC AGACCGGCAT GGAAGACCAG
ATCGGCGTGC TCGGACTCGC CCTGAACTGC GTGGTCCTGT GGAACACCTG GTACATCGAC
GCCGCCGTCA CAGCCCTCGA AGCCGGCGGG ATGACCCTGT CCGCCGAGAT CCGATCCCGG
CTGAGCCCCC TGGTCTTCGA GCACATCAAC TTCCACGGCT CGTACCCGTT CGTCCGGCCC
GGTCTGGCCG GAGACCTACG CCCGCTACGG AACCCCACCG ACACCGACGA GCAGTGA
 
Protein sequence
MAIEYLSEEQ VRRYGGFARD PSPGELEQSL RMDRATLALV ASKRREANRL GWSVQWATVR 
MVGTFLTDPA EVPAVVAAFM AEQVGVVDPG CLKGYSTLEE MRTTVTAVSG RGLVGALDRV
SSVWAVGTGG VEVAAVPPVK LAELAAYGMV TKATTIRGLH DDRKVATVLA TVRHLEAVSV
DDALLLFDIL MATKLLARAE RVSGTKRLKT LPRFRQAAGR VAAAITVLLD VPQARDGQVM
TVAEMWTAIE QVVPREKLQA ALVTVAEYLP DEAEDDDADW RTELVTRYAT VTGFLELLAE
TIAWGATPAG APIVAALRDL PRVKARRAPE AAHIGEHAGL VTGSWRRLVY ANPQLSAPLI
DKHAYVFCVL EHLHRALRRK DVYALGADRW GDPRARLLDG DAWEQARPRG LSALGLPEQP
TDHLAELVCD LDAAYRQVSA GLPTNSAVQI EGGRIRLDRL AAAPDPAGME AARDAVAALL
PRVDYAELLL EVFERTGLPG AFTHISGSDA RPADFDWGGG LVAGADRMRF VVPVRSIAAR
PSAKYFAGSK RRAGATWLNV VSDKVMGLGG LVVPGTPRDS LHILDALHNL DVDEPPEIIT
TDTGSYSDLV FGLFAISGYQ FSPRIADLAD TRLWRTRTDA DYGPLNTVSG HTVNLARIVE
HWQDMLRVAG SLITGEVRAH DMIRMLTREG NPTGLGNAFV AYGRLFKTLH VLQVLHDESY
RRMISAQQNI TEGRHSLARR IFFGNRGELR QRYQTGMEDQ IGVLGLALNC VVLWNTWYID
AAVTALEAGG MTLSAEIRSR LSPLVFEHIN FHGSYPFVRP GLAGDLRPLR NPTDTDEQ