Gene Francci3_1258 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1258 
Symbol 
ID3906104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1501855 
End bp1504473 
Gene Length2619 bp 
Protein Length872 aa 
Translation table11 
GC content76% 
IMG OID637878592 
ProductComEC/Rec2-related protein 
Protein accessionYP_480365 
Protein GI86739965 
COG category[R] General function prediction only 
COG ID[COG0658] Predicted membrane metal-binding protein
[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0672087 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGACCG GACTCGCGCT GCCGGAACGG CCGTCCGGGG CTTCCCGTCC CATGGACGCC 
CGGCTGGTGG GCCCGGCGGT CATGGCCTGG GCTGGCGCGG CGGGTGCCGG CACCTGGCCG
CCGTCCGTTC CGTTGCTGGG CGCCGCCGGT GCGGTGCTCG TCGCGGCTCC GATGTTCCTG
CTGCTCCAGG TCCGTCCGCG GGGTGGTCGG CCGACTCGCC CGGGACCGGG CGGCGCTCGG
GCCGAGGCGG CTACGGGCGC GTCTCGTCCG TCGCAGTCGC AGTCGCAGCC GCGCAGCGCG
CCGCTCACCC GGGGCGTTGG GCGGTGCTCC GGCGGCACTG TCGTCGTGCT CACGGCCCTC
GTCTTCCTGG CCGCCGGGTT TCTCGCGGGT GGACTGGCGG CTCGTCCGCG TGACAGCGGC
CCGCTCGCCG ATCTGGTCCG GCATGGCCGT CCGGTCACCG CCGAGGTCGT CGTCACCGAC
GATCCCCGCG CCACCTCCTC GTCCACCGCG GTCCGGACGG GAACCGGGCC CGGAGCCGGA
GAGGGAAGAG GAGAGGGCGG CGGCGCCGAT CGCGTGACCT TCGTGATCTC CGTCCGGGCG
GAACGCCTTA CCGCGTCACC GGCGGATCTG CGGGTCCGGG CCCCGATGGT CCTGCTCGCG
CGGGGTGGCG GCTGGGGTGC TCTGCTGCCC AGCCAGCACC TGGTCGTCTC CGGCAGGCTC
GCCGAACCCC GGGTCGGTGA CACCGTGGCC GCCGTCCTGT TCGCCGACGG CCCACCGCGG
ACCCGCGGCG GACCCGCAAT GATCCAACGA ATCGCCGGGG GGCTGCGGGC CGGGCTGCGT
CGAGCCGCTG GGGGACTGCC TGAACCGCGT CGCGGGCTGC TGCCCGGCCT CGTGGTCGGC
GACGTCTCCG GGCTCGACGA TGCGGTGCGC GCGGACTTCC GCGCCGCCGG AATGTCCCAT
CTAACGGCCG TGTCCGGTAG CAACGTCGCC ATCGTGACGG CCTCGGCGCT CTACCTCATG
GGATGGACCG GGCGTGGGTC ACGGTCGCGC GCCGCGGTGG GAGCCCTCGC TCTCGTCGGG
TTCGTCGTGC TCGCCCGGCC CTCGGCCAGC GTGCTGCGTG CCGGGGCGAT GGGTCTGGTC
GGCCTGCTCG GGCTCGCCGT TGGCCGTCCC AGGGCGGTGC TGCCCTCCCT GGCGGCGAGC
GTGATCATCC TGATCCTGGC GGATCCCGCG CTCGCCCTCT CGGTCGGCTT CGCGCTGTCC
GTCCTCGCCA CGGCCGGAAT GATCGTTCTC GCGCCGGGCT GGCGGGACGC GCTCGCTCGC
CGGCTGCCGG CGCGAATCGC CGAGGTGCTC TCCGTCGCGG CCGCCGCCCA GCTGGCGTGC
ACCCCGGTGT TGGCCTGGAC CGGCGGGGGG CTGAGCCTCG TGGCGGTGCC CGCGAACGTC
CTGGCCGTGC CCGCCGTCGC ACCGGCGACC GTGCTGGGAG TACTGACGCT GGTCGTGGCG
GCGGTGTCGC CGTCAGCAGC CGGGCTGCTC GCGCACCTCG CCGATCTGCC GTGCTGGTGG
CTGGTGACGG TCGCCGGCCG GTGCGCCGAT CTTCCTGCCG CGACGCTGCC GTGGCCCACG
GGGGTGATGG GAGCGGGCGT GGCCGCCGGT GTCGCCTGGC TCGTGGTGGC GACGCTGCGG
CGGCGGGTGC CGCGGCGACT GGTCGCTGCG GCCCTGGTCG GCCTGCTCCT GGCGCGATGC
GCCGTCGCCG GGCGGCTGGC GCCGTGGCCG CCGCCCGGCT GGCGCCTGGT CGCCTGCGAC
GTCGGTCAGG GCGACGCCCT CGTGCTGTCG GCGGGACCGG GCACCGCGGT GCTGGTCGAC
GCCGGCCCCG ATCCGGCCCT GCTGACCCGC TGCCTCTCCG ACCTCGGCGT GCGACGGATA
CCGGTGGTCA TTCTCAGCCA CTTCCACGCC GATCACGTCG AGGGGCTGCC GGCCGTGCTC
GGCCGGCTTC CGGTGGGGGA GGTACTCGGC AGTCCGCTGG GGGAACCCGT TCTCCAGTGG
CACCGGGTCC AACAGTGGAC CCGGCGGGCC GGTGTGCCGC TGCGGACGGC CGTCATCGGC
TCCCGGGCAC AGGTGGGGGC GGTCTCTTGG ACGGTCCTGG CTCCGCGCAC CGTGTTGCAC
GGCACCGAGA GCGATCCCAA CAACGCGAGC CTGGTGCTCT CCGCACGGGT GGGGGAGGTG
ACCATCCTGC TCACCGGGGA TGTGGAGCCG CCGGCCCAGC GGGTGTTGAC CGGCAGCCCG
GAGGATCGGA CGGCGCTACG AGCCGACGTG CTCAAGGTGC CCCATCACGG CGCCGCTGAT
CAGGACGCCA CCTTCCTCGC GGCGACCGGG GCCCGATTCG CCCTGATCAG CGTCGGCACC
GGGAACAGCT ATGGTCATCC CGCGCCGTCC ACCTTGCGGA CGCTGCGCCG GTCCGGCATG
GCCGTCGCCC GGACCGACCG CGACGGCGCG GTGGCGGTGG TCGCCACGGC GGCACCCGCC
GGGTCCGCCG AGTCCGGGTC GCTGGCGGGA GCCGCCTCCG GGTCGGGGGC GGGAGTCCGC
GTCTCGGTGG TGCTCCGCCG GCCCGGGGGC GGATCGTGA
 
Protein sequence
METGLALPER PSGASRPMDA RLVGPAVMAW AGAAGAGTWP PSVPLLGAAG AVLVAAPMFL 
LLQVRPRGGR PTRPGPGGAR AEAATGASRP SQSQSQPRSA PLTRGVGRCS GGTVVVLTAL
VFLAAGFLAG GLAARPRDSG PLADLVRHGR PVTAEVVVTD DPRATSSSTA VRTGTGPGAG
EGRGEGGGAD RVTFVISVRA ERLTASPADL RVRAPMVLLA RGGGWGALLP SQHLVVSGRL
AEPRVGDTVA AVLFADGPPR TRGGPAMIQR IAGGLRAGLR RAAGGLPEPR RGLLPGLVVG
DVSGLDDAVR ADFRAAGMSH LTAVSGSNVA IVTASALYLM GWTGRGSRSR AAVGALALVG
FVVLARPSAS VLRAGAMGLV GLLGLAVGRP RAVLPSLAAS VIILILADPA LALSVGFALS
VLATAGMIVL APGWRDALAR RLPARIAEVL SVAAAAQLAC TPVLAWTGGG LSLVAVPANV
LAVPAVAPAT VLGVLTLVVA AVSPSAAGLL AHLADLPCWW LVTVAGRCAD LPAATLPWPT
GVMGAGVAAG VAWLVVATLR RRVPRRLVAA ALVGLLLARC AVAGRLAPWP PPGWRLVACD
VGQGDALVLS AGPGTAVLVD AGPDPALLTR CLSDLGVRRI PVVILSHFHA DHVEGLPAVL
GRLPVGEVLG SPLGEPVLQW HRVQQWTRRA GVPLRTAVIG SRAQVGAVSW TVLAPRTVLH
GTESDPNNAS LVLSARVGEV TILLTGDVEP PAQRVLTGSP EDRTALRADV LKVPHHGAAD
QDATFLAATG ARFALISVGT GNSYGHPAPS TLRTLRRSGM AVARTDRDGA VAVVATAAPA
GSAESGSLAG AASGSGAGVR VSVVLRRPGG GS