Gene Francci3_4145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4145 
Symbol 
ID3907110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4945748 
End bp4947202 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content73% 
IMG OID637881473 
Producttransposase IS66 
Protein accessionYP_483222 
Protein GI86742822 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTCTGC TTGCGGACCG GGACGCCACC ATCGCCGCCC AGGCGGCCAC GATCGCGGAG 
CAGGCGCGGC TGGTCGAGCG GCTGGTCGGG CAGGTGGAGC AGCTCACCGA CAGGGTCCGC
GAACTGGACC GCCAACTCGG CAGGGACAGT ACGAACTCGT CGTGGCCGTC GTCGTCGGAC
AGCCCGTATA CGAAGAAGAA GGCCAAGCCC CGGTCGTCGC GGACGTCGAT GGGCCGGCCG
AGGGGTAAGC AGCCCGGCGC GACGGGCGCG ACCCGTCAGA TGGTCGACGA CCCGGACGAG
ATCCACACGA TCGACCCGTC GTTATGCGCG GACTGCGGGT TCCCGCTGGC CGGAGCGGCG
AGGCTCACGA CGCGGCGCCA CCAGATCTTC GACCCGCCGC CCCCGCCGCG CCCGTATGTC
ATCGAGTACC GGATCGTGAC GCGGGTCTGT CCGTGCTGCG CGGCGACGAC CGAGGGGCTG
ACACCCGTCC CGCTGGCGGG CCGGCTTGTC TGGGGCCCGC GGATGCTCGC GCGAGCGGTG
TGGCTCGTGT GCGCGCACCA CCTCCCGATC CGCCGCGCCG CGGCGGTCCT GACGGTGATG
GTCGGCGCGA CGGTCTCCGC CGGCTGGGCC GGCGGCGTGC GAGCCCGCGC CGCGCGTCTG
TTGGAGAACA CCTTCCTCCC GCACGTGCGG GCGTTGATCG CCGCCGCGCC GGTCGCGCAC
GCCGACGAGA CGACCGCCCG CGCCGACGGC GCGCTGCGCT ACGTCCACGT CGCCGCCACC
GACTACCTGA CCGCGCTGCA CACCGGTGAC CGGACCGCCG AGACGATCGA CGCCGGCGGG
ATCTGGCCGG CGTTCACCGG CGTGCTGATG CGGGACGGCT ACCAGGGCTA CACCCACCTC
ACCCGGGCGC TGCACGCCTG GTGCGGCGCG CACACCCTGC GCGACCTGCG GTCCATCCAT
GACGGCGACC GTGGCGGGCA GGTCTGGGCC GACGCGATGG CGACCACCCT GCTCGACGCC
CACCACGCCG CGTGCGACGC CCGCGACGCC GGGGCGAACG CGCTCGCGCC CGAGGCCGTC
GCCCTCATCC GCAACCACTA CCGCGGCGCG CTCGCCCGCG GCGAGACCGA CAACCACGGG
GACCGCTCGT CACTCGCCCA CGACGCTCGC ACACTGATCC GCCGGATGCG CCGCGAGGAG
GACATGATCC TCCGCTTCGT CGTCGACCTG ACCGTGCCCT TCTCGAACAA TCAAGCAGAA
AGGGACGTCA GGCCGGTCAA GGTCCAGCAA CGCACCTCCG GTGGTTGCTG GCGGACCCTG
GCCGGCCTCG TCGACTTCGC GGTCGTGCAG TCCTACCTGT CGACCGCGAC CAAGTGGGGC
CTCGACACCC TCGACGTTCT CGAACGACTC TTCACGACCG GCCCCTGGCT ACCGCCCGCC
GCTGAACCCG GCTGA
 
Protein sequence
MRLLADRDAT IAAQAATIAE QARLVERLVG QVEQLTDRVR ELDRQLGRDS TNSSWPSSSD 
SPYTKKKAKP RSSRTSMGRP RGKQPGATGA TRQMVDDPDE IHTIDPSLCA DCGFPLAGAA
RLTTRRHQIF DPPPPPRPYV IEYRIVTRVC PCCAATTEGL TPVPLAGRLV WGPRMLARAV
WLVCAHHLPI RRAAAVLTVM VGATVSAGWA GGVRARAARL LENTFLPHVR ALIAAAPVAH
ADETTARADG ALRYVHVAAT DYLTALHTGD RTAETIDAGG IWPAFTGVLM RDGYQGYTHL
TRALHAWCGA HTLRDLRSIH DGDRGGQVWA DAMATTLLDA HHAACDARDA GANALAPEAV
ALIRNHYRGA LARGETDNHG DRSSLAHDAR TLIRRMRREE DMILRFVVDL TVPFSNNQAE
RDVRPVKVQQ RTSGGCWRTL AGLVDFAVVQ SYLSTATKWG LDTLDVLERL FTTGPWLPPA
AEPG