Gene Francci3_4011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4011 
Symbol 
ID3906972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4793298 
End bp4796630 
Gene Length3333 bp 
Protein Length1110 aa 
Translation table11 
GC content68% 
IMG OID637881340 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_483090 
Protein GI86742690 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.724918 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGGG CTGCCGAGTT CGAGCTGGTT GAGCGTCCGC TTGTCGATCA GCTGCGCGGG 
TTGGGCTGGG CGTATCTTGC CGGGACGAAG CACGCCCCGG CCGCGAGCGG TCGGGCGTCG
TTCTCTGAGG TGCTGTTGGA GGGTCGGCTG CACGCGGCGC TGCGGCGGAT CAATCCAGGG
CCGGACGGCG AGTTGTGGCT TGATGATGGC CGGCTCGCGC AGGCCGTTGC CGCGCTGACT
CGGCCGAAGG CGACCAGGCT GGTCGAGATC AACGAGGAGT TGACGGAAGC GCTGTTGCTC
GGTGTGCCCG TCGAGGGGAT TCCGGAGGTC GATGGTGGTC GGAACCGTCG GGTTCGGTTC
ATCGACTGGG ACACGCCGTC GAACAACGAC TTTCTGGTGG TGAACCAGTT TCGGGTGGAC
ATCCCGGGCA GCCAGGGGCG CCGCTACATC GAGCCGGACA TCGTGCTGTT CGTCAACGGC
ATCCCGCTGG TGGTCGTGGA GGCGAAGAGC CCCACCACGT CCTCGCCGGT GGCGAAGGCG
ATCCGTCAGC TGGCCCGGTA CGCCGACCAG CGCGGGGCTG TCACCCGCGA GGGCAACGAG
CGGCTATTCT ACACCAACCA GTTCGTCGTC GCCACGAGCT GGGACGAGGC GAAGGTCGGC
ACGTTCACCT CGGACCCGGA TCACTTCGCC GAGTGGAAGA CAACCGAGCC GACTCCGGAG
ACCGAGGTCG CCGAAGCACT CGGCGTCGAC GGGCTGACCA GCCAGCAGCG GCTCGTCGCC
GGCATGCTCG CCCCGGAGCG GCTGTTGGAC ATCGTCCGGC ACTTCACGCT ATTCATGCCC
GCGCAGGCCG GGCGGACGAT GAAGATCGTC GCCCGCTACC AGCAGTACCG GGCGGTCCGG
CGCACCATCC ACCGCCTGAG CACCGGCAAG ACCCGGGTGG CCGACGGCGA GTTCGACCGG
CGCGGCGGGA TCATCTGGCA CACCCAGGGC TCGGGCAAGT CGCTGACCAT GGTGTTCCTG
ATCCGGGTCA TGCGGACGCA TCCCGATCTT GTCGGTTTCA AGGTCGTCGT GGTCACCGAC
CGGAGGGACC TCCAGGACCA GCTCGCGAAG ACCGCCGAAC TCACCGGAGA GACACCCCGC
ACGGCGAGGA ACGTCCAGCA GGTTCGCTCG CTGCTGTCGG TTCCGGGCAA GGCGCTGCTT
TTCGCGATGA TCCAGAAGTA CCGGAACCCG GACGCGGCGA AGGACGCCCC GCTGGAGGTG
AAGTCACTCG GAGTGCTGGA CCCGTCCGAG GGTGTCGTCG TGCTCGTGGA CGAGGCGCAC
CGCTCGCACA GCAGCACTCT GCACTCCGTG CTGCTGGACG CGTTGCCGAA CGCGGCACGG
ATCGGGTTCA CCGGGACGCC GATCATCATG GGTAAGCGGA AACGCACCCA TGCGATCTTC
GGCCCGTATC TGGACCGGTA CACCATCAGC GAGTCTGAGG CGGATGGCGC GACCGTGCCG
ATCCGCTACG AGGGCCGTAC GACGAAGAGC GACGTCCAGG ACTCGGCGGA CCTCGACGAG
CTGTTCGAGG ACATGTTCCC CGACCTCACC GATGCGCAGC TGGCGAAGAT CCGACGTCGA
TGGGGAACCA TCGGCAACGT GCTGGAAGCG GAGAAGATGA TCACGGCGAA GGCCCGCGAC
ATGCTCCGCC ACTATGTCGA CACAGCCTTG CCCGGCGGGT TCAAGGCGCA GATCGTCGCG
ACCAGCCGGC TCGCCGCGGT GCGCTACCGC GACGCCCTGC TGGCCGCTCG TGATGATCTC
GTCGCCCGGC TCGATGCACT GCCCGCCGAG TTGCGCACCC CCGAGGCCGC GGACCGGGCC
ACCTCCCCCG AGGGCATCCC CGGCCTCGGG CGCGCCCGGC TGCGCGACGT CCGGGCCTGG
CCGTACCGCG ACCTGATCGC TCGGCTCGAT TTCGTCCCGG TCATCTCTGG TGAGCAGAAC
GAGAAGGACT GGCGCGACTG GATCGACGAG ACCCGGCAGA AGGTGGTCAT CGAGGAGTTC
AAAAAGCCGC TGCCCGCACC CGATGACCCG GCACCGGACC CGGCCACGAC CAGCACGGTT
GCCTTCCTGC TGGTTAAGTC GATGCTGCTG ACCGGCTTCG ACGCCGCCGT CGACCAGGTG
ATCTACCTCG ACCGACGGAT CAAGGAAGCT GAACTGCTCC AGGCCATCGC TCGCGTGAAC
CGCACCGCTC GGGGGAAGGC CAACGGCTAC GTCGTCGACT ACTTCGGCGT CGCCAAGCAT
CTGCACGCCG CGCTGGAAGC GTATGCGGCC GAGGACATCG ACGGCGCGCT GGCGAGCATC
ACCGACGAGC TTCCACTGCT GGCCGACCGG CACGCCAGGG TCCGCGCGCT GTTCACAGAC
CGCGGCCAGG AGCGTTTCGA CACCCCGGCA GATCAGGAAG CCTGCGTCCA GATCCTGGCC
GACGACGCGC TGCGGGCCGC GTTCCAGGTC GCCTACCGGG CGCTGACCCG CAGCCTGGAG
ACGGTCCTGC CACGACCGGA AGCGCTGCCC TATGCGGCCG ACACCAAGGC GTTCGGAGTG
ATCGGCCTGC TCGCACGGCG ACGCTACGAC CGCGATGATC CGGACTTCGA CGTCTCCGTT
TACGGCGAAA AGGTACGCCG CCTGATCGAC GACCACATCG TCGCGCTCGG CATCAGCCAG
AAGATCCCAC CTGTGTCGCT CACCGACGCC CGCTTCGACG AGAAGGTCGG CGGGCTGACG
TCGAAGCGGG CGAAAGCGTC GGAGATGGAG CACGCCCTGC GCCACCACAT CAGCGGCATG
CTCGACGCCG ACCCGGTTCG CGCGCGCACC CTGTCCCAGC AGCTGTCGGA GATCCTCGAC
CGGCTCCGTG ACCAGTGGGA CCAGCTCGTC GCCGAACTCG GCGACCTGAT CGACAAGGCC
CGCGCGGGAC GGACCACCGC CGAGGATCCC GACGACGCGC CCGACGGTGT CCAGCTGACT
CCGATCGAAC GCCTCTTCTT CGACATCCTA CGCGCCGAAC GGGTCGCCGA GGGCAAGGAG
ATGACACCAG TCGCCATCGA GGCCGTCGCC GAGCTCGTCA CCTTCATCGT CGATCATCTG
TGTCGGGAGA TCGGGAGGGT GCACTTCTGG GGGAACGCCC ACGCCCAGCA ACGCCTGCAC
AGCTGGATCA CGGTAGCGGT CAGCGACGTG TCCATCGACG GCGACGACCT GTTCGACCAG
GACCGGGCGG AGGCCATCGC CGACCAGATC GTCGAACTGG CACGGCACAA CCACGCTGCG
GTTGTCACGA CAGTCCCGCC CACGGTCAGC TGA
 
Protein sequence
MTGAAEFELV ERPLVDQLRG LGWAYLAGTK HAPAASGRAS FSEVLLEGRL HAALRRINPG 
PDGELWLDDG RLAQAVAALT RPKATRLVEI NEELTEALLL GVPVEGIPEV DGGRNRRVRF
IDWDTPSNND FLVVNQFRVD IPGSQGRRYI EPDIVLFVNG IPLVVVEAKS PTTSSPVAKA
IRQLARYADQ RGAVTREGNE RLFYTNQFVV ATSWDEAKVG TFTSDPDHFA EWKTTEPTPE
TEVAEALGVD GLTSQQRLVA GMLAPERLLD IVRHFTLFMP AQAGRTMKIV ARYQQYRAVR
RTIHRLSTGK TRVADGEFDR RGGIIWHTQG SGKSLTMVFL IRVMRTHPDL VGFKVVVVTD
RRDLQDQLAK TAELTGETPR TARNVQQVRS LLSVPGKALL FAMIQKYRNP DAAKDAPLEV
KSLGVLDPSE GVVVLVDEAH RSHSSTLHSV LLDALPNAAR IGFTGTPIIM GKRKRTHAIF
GPYLDRYTIS ESEADGATVP IRYEGRTTKS DVQDSADLDE LFEDMFPDLT DAQLAKIRRR
WGTIGNVLEA EKMITAKARD MLRHYVDTAL PGGFKAQIVA TSRLAAVRYR DALLAARDDL
VARLDALPAE LRTPEAADRA TSPEGIPGLG RARLRDVRAW PYRDLIARLD FVPVISGEQN
EKDWRDWIDE TRQKVVIEEF KKPLPAPDDP APDPATTSTV AFLLVKSMLL TGFDAAVDQV
IYLDRRIKEA ELLQAIARVN RTARGKANGY VVDYFGVAKH LHAALEAYAA EDIDGALASI
TDELPLLADR HARVRALFTD RGQERFDTPA DQEACVQILA DDALRAAFQV AYRALTRSLE
TVLPRPEALP YAADTKAFGV IGLLARRRYD RDDPDFDVSV YGEKVRRLID DHIVALGISQ
KIPPVSLTDA RFDEKVGGLT SKRAKASEME HALRHHISGM LDADPVRART LSQQLSEILD
RLRDQWDQLV AELGDLIDKA RAGRTTAEDP DDAPDGVQLT PIERLFFDIL RAERVAEGKE
MTPVAIEAVA ELVTFIVDHL CREIGRVHFW GNAHAQQRLH SWITVAVSDV SIDGDDLFDQ
DRAEAIADQI VELARHNHAA VVTTVPPTVS