Gene Francci3_3104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3104 
Symbol 
ID3904230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3675959 
End bp3677713 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content74% 
IMG OID637880425 
Producthypothetical protein 
Protein accessionYP_482190 
Protein GI86741790 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex
[COG0847] DNA polymerase III, epsilon subunit and related 3'-5' exonucleases 
TIGRFAM ID[TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.197929 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.651768 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCCCAG ATCCCGCCCC GGACCGTCCC CGGCCGCCCC GACAGGGCAG GCTGGATGAT 
CTCGGTCGTC CGTTGGCGGA CGTGACCTTC GTCGTGTTCG ACCTGGAGAC GACGGGCACC
TCCCCGGGAC GCGACGAGAT CACCGAGATC GGCGCGGTCC GGGTCCGCGG CGGCCGGATC
CTCGCCGAGA TGGCCACCCT GGTCCGGCCG GGCGTCGGCA TCCCTCCGAT GGTCTCGGTG
CTCACCGGCA TCACCGACGT GATGGTGGCG ACGGCGCCGC CGGTCACGCA GGTGCTGCCT
ACCTTCCTGG AGTTCGCCCG CGGCGCCGTT CTCGTCGCCC ACAATGCCCC GTTCGACCTC
GGCTTCCTGC GCGCCGCCGT CGAGCTCTGC GGCTATCCGG TACCCGTCTG GGAGTATCTG
GACACGTTGC GTATCGCCCG GCGGGTGGTC ACGAGAGACG AGAGCCCCGA CTGCCGGCTC
ACGTCGCTGG CCTCGCTGTT CCGCAGCCCG GTCGAGCCCC GCCACCGGGC GCTGGCGGAC
GCCCGGGCCA CCGTCGACGT GCTGCACGGG CTGTTCGAAC GGCTCGGCAA CGCGGGCGTG
ACCACCCTGG AGGACCTGCA CGACTACAGC TCCCGGGTGT CGCCGGCCCA GCGACGCAAA
CGGCATCTGG CCGACGGCCT GCCGACGGGC CCGGGTGTCT ACATCTTCCG GGACGCCGAC
GAACGAGCCC TGTATGTCGG CACCTCGCGT TCGGTGCGCT CCCGGGTCCG TACCTACTTC
ACCGCCAGCG AGCCCCGGAC GCGGATGGCG GCGATGGTGG CGCTGGCCGA GCGGGTTGAC
GCGATCGGAT GCGCGCACGC CCTGGAGGCC GAGGTCCGGG AACTGCGGCT GATCGCCGAG
TACAAGCCGC CGTACAACCG CCGATCCCGC TTCCCGGAGC GCTCCGTGTA TCTCAAGCTC
ACCGACGAGC CATTCCCACG GCTTTCACGG GTGCGCGCCG CCCGTGACGA CGCGACCTAT
CTCGGGCCGT TCGGCAGCGT CCGCGCCGCC GACGCCGCCG CCGAGGCGCT GCTGGCCGCG
GTGCCGCTGC GCCAGTGCTC CGGGCGCCTG TCCCCGCGCG TGCGCCGGTC CGCCTGTACC
CTCGCCGACC TCGGCAGATG CGGAGCGCCG TGCGACGGCC GGGAGGACGT GGCGAGCTAC
GGCCGGCACG TCGCCGCCGC CCGGGCCGCC ATCACCGGCG ATCCGGGCCG GGTCATCGCC
GCCTCGACGC GGCGGATCGA CCGGCTGGCC GCCGAGCGGC GCTACGAGGA GGCCGCCGTC
CAGCGGGACC GGATGATCGC GTTCGTCCGC GCAGCCGCAC GTGCCCAGCG GCTGTCGGCC
CTCACCGGGG TCGCCGAGCT CGTCGCCGCG GCCCCGACCG CCGAGGCCGG CTGGGATCTG
GCCGTAGTGC GTCACGGTCG GCTGGTGTCG GCGGCGAGCG TGCCGCCCGG TGTCGATCCG
CGGCCCTGGG TCGACGCCGC GGTCGCCAGC GCCGAGACGG TGCGGCCGCG GCCCGGTCCG
GCCCCGTGCG CCTCCGTCGA GGAGACCGAA CGCATCGCCC GTTGGCTCGG TGGGCCTGGG
GTGCGGCTCG TGCGGCTGGA GGGCGAGTGG AGCTGGCCGG CCGCGGGCGC CATCCGCGCC
GCAGCCGGAT TCGGTGCGGC CCCCGGCCGG TCGGTACGTG CGTATGACGG TGACGGATGG
TTCCCGTCGG CCTGA
 
Protein sequence
MRPDPAPDRP RPPRQGRLDD LGRPLADVTF VVFDLETTGT SPGRDEITEI GAVRVRGGRI 
LAEMATLVRP GVGIPPMVSV LTGITDVMVA TAPPVTQVLP TFLEFARGAV LVAHNAPFDL
GFLRAAVELC GYPVPVWEYL DTLRIARRVV TRDESPDCRL TSLASLFRSP VEPRHRALAD
ARATVDVLHG LFERLGNAGV TTLEDLHDYS SRVSPAQRRK RHLADGLPTG PGVYIFRDAD
ERALYVGTSR SVRSRVRTYF TASEPRTRMA AMVALAERVD AIGCAHALEA EVRELRLIAE
YKPPYNRRSR FPERSVYLKL TDEPFPRLSR VRAARDDATY LGPFGSVRAA DAAAEALLAA
VPLRQCSGRL SPRVRRSACT LADLGRCGAP CDGREDVASY GRHVAAARAA ITGDPGRVIA
ASTRRIDRLA AERRYEEAAV QRDRMIAFVR AAARAQRLSA LTGVAELVAA APTAEAGWDL
AVVRHGRLVS AASVPPGVDP RPWVDAAVAS AETVRPRPGP APCASVEETE RIARWLGGPG
VRLVRLEGEW SWPAAGAIRA AAGFGAAPGR SVRAYDGDGW FPSA