Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3777 |
Symbol | |
ID | 5672142 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4478201 |
End bp | 4481512 |
Gene Length | 3312 bp |
Protein Length | 1103 aa |
Translation table | 11 |
GC content | 79% |
IMG OID | 641242658 |
Product | exonuclease SbcC |
Protein accession | YP_001508078 |
Protein GI | 158315570 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0419] ATPase involved in DNA repair |
TIGRFAM ID | [TIGR00618] exonuclease SbcC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00618704 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCCGG TGCAGCTGGA CCTCGCCGGT TTCGGATCGT TCCGGGAACC GGCGGTGCTG GACCTCACCG ACGCCGACTA CTTCGCGCTG GTGGGCCCGA CCGGCGCCGG CAAGTCGACG CTGATCGACG CGCTGACGTT CGCGCTGTTC GGCTCGGTGC CGCGCTGGAA CAACCGGGCG ACGGTACATC TGGCGCTCGC GCCGACCGCG GCCCGCGGCA CGGTACGTCT CGTCTTCGAC GCGGCCGGCG CCCGCTACGT GGTGGCGCGG GAGCTGCGCC GCGCCGCCCG CGGGGGTGTG ACCGTGCGCA ACGCCCGGTT CGAGCGGCTG GTCGACCCGG CCGGCGCGGG CGGGCCGGGG GAGCCGACCG ACGTGCTCGC CGCGGGCGCG CCCGCGGTCA CGGCGGCGGT CGAGGAACTG CTCGGGCTCA CCTTCGAGCA CTTCTGCACC TGCGTGGTGC TTCCGCAGGG CGAGTTCGCC GAGTTCCTGC GGGCGAAGCC GAGCGAGCGG CAGGCCATCC TCACCCGGCT GCTCGGGCTG GGCGTCTACG ACACGATCCG CGCGGCGGCG AACGCCCGCG CGAGCGACCA GCGCCAGCGC GCGGACGTCC TCGCCGAGCA GCTCGCCGGG TACGCGGACG CGACCGCCGC TGCCGAGGCC GCCGCGGCCG GCCGGGTCGG CGCGCTCACC CGGCTCGCCG CCGAGGTGGC CGCCGCCCTC GCCGAGGTGG CGCGGCTGGA CACCGCCGCC GCCGAAGCCG CCGCCCGGAC CGCCAGACTG GGCGCCGAGC GCGACCAGCT CGCCGCCGTC GAGCCGCCGG CCGGGCTGGT GGACCTCACC GGGCGGCTGC GCGCTGCCCG GGCCGCCGAC GCGGACGCCC GTCGGCGCCG CCAGGAGGCC GAGCGGGCGG ACAGCGCCGC GCGGGGAGAT CGCGCCGCCG CGCAGCCCCG TGCTCCGCTC GACCGGTACG CCCGCGACCA CCGGGAGCTC GCCGACCTGC TCGCGACCAG ACCGGCGGCC GTCACCGAGC ACGCCGAGGC CGACGCCGCG CTCGCCCGGG CGGACGCCGA ACTCGGGGCG GCGCGCGCCG CGCTGGCGGC GGCGACCGCC GCCGAACGCG ACGCGGAGCA GGCCGCTGTA TCCGCTGCCG CGGAGTTGAG CCGGCTGGAC TCCGAGCTGG GGCTGCTCGC CGATGTCCGG GCGCCGGCCG GGCTGGCGGC CCTGACCGGC CGCCTGCGCG GGGCCGAGGC GGCGCGGGCC GAGGCCTTGG AGCGCCGGGA GGCGGCCGAG CGGGCTGACA CCGCCGTCCG CGAGGAGCGG GCCGCCGCAC CGTCACGCGG CCCGCTGGAG CGGGCGCTGC GCGATCATGA GCTGCTCGTC GAGCTGCTGG CCGGCCAGGC CGCTGCGGCG GCCGCGCACG CCGACGTCCA GGCGGCGCTC GGCCGGGCCG ACGCCGCACT GGAAGAGGCC CGGCACACCC GGCACCGGGC GGCCGGAGCC CGGGATGATG CCGCCCGCGC CGACCTGGCC GCCGCGTTGC GGCCGCACCT TGCCGTGGGG GAGGACTGCC CGGTCTGCGC GCGGCCCGTC GCGACGCTGC CGCCCCCGCT GCCGGCCGCC GGCCTCGGTA CCGCGGACGA GGCGGTCACC GCCGCCGAAC GCGAGGTGAC CGCCCGCGCC GCCGAGGCGA CCGCCGCGAC CCGGGCGGCG GCGGAGTCCG AGGCCCGGCT CGCGGACCTG ACCAGGCGGG TCGAGGAGCT GCGCCGCGCG GTCGCCGACG CGCCGGACGC CCGCCGCGCG GCCGGCCTCC TCGCGGACGT CGAACGGCTC GACGGCGCCG TCGCCCGCGC GGACCTGGCC CTGCGCCGGG CCCGCGACGC CGCGGCCGCC GCCGACACGG TCGTCACCCG CTCCCGGGAC GGGGCGGCCG GGGCGCGGCG GGCGCTTGAC GCCGTCCGCG ACCCGCTGGT CTCCCTCGGC GCGCCCGGGC TGGCGGGCAC CGACCTGGAG GCGGACTGGG CCACGCTGGT CACCTGGGCC GCCGAGCTCG CCCGCGACCG GACCGGTCAG CGTGACGCCG CGGCCGAGCG TTCGGCCGCC GCCACGGCCT CCCTGGCCGC CGCGCGGCAG GCGCTGGCGC TCGCCTCCGC CGGGCAGGCG AGGGCGGACG AGAACCGGAT CGCCGAGGCC CGCGCCCAGC AGCGGGCCGC CACCCGGGTC GCGACCCTCG ACGACCGGGC CGAGGCCCTG CGTGTCGTGC TCGCCAGCGC GCCGACCGCC GTCGAGGTCG CGGCCGCGCT CGCCGAGCTG GACCGGCTCG CCGCTACCGC CGACCAGGCC GACCAGCGGC TGCGCGCCGC CCGGGCGGAG GCCGACGACG CCGAGCGGGC GTTCGAGCGG GCCCGGGACG CGCTCGGGGC GGCCGGCGGC GCGCTGGCCG CGGCCCGCGA CGGCCTCGTC CCGCTCGGCG CGCCCGCCGT CGACAGCGAG GACGTCCTGG CTGGCTGGAC GGCGCTGACC GACTGGTCCC GCACGCAGGC CACGGCCCGT GACCGCCTGC TCGCGGCGGC CACCGCCGAG GCCGAGGACG CCCGCCGACG GCACGACGAG GCGGACACGG CGCTGGTCGA GCTGCTCGCC GCCCACGACC TGCCGCCTGG CCCCGGTTCC AAGGCCGCCG ACGGTGCGTC CGTCGCAGTG GCGGACGGGC TGGCGGGGGC CCGCGCGGAG CTCGCCCGAG TCGTCGAGCG CCGTGCCGAG GCCGGCCGGC TCACCGCCGA GGTCGCGACG GCGCGCGAGG CGGACCATGT CGCCCGCCAG CTCGGCCAGC TGCTGCGCTC CGACCGGTTC CCGCGCTGGA TGGTGGCCGC CGCGCTGGAC ATCCTGGTCG AGGAGGCGTC GGTGACACTG TCGGAGCTGT CCGGTGGGCA GTTCGAGTTG ACCCACGAGG ATGGCGAGTT CGTGGTCGTC GACCATGCCG ACGCCGACTC GCGCCGGCCG GTGCGGACCC TGTCCGGCGG TGAGACGTTC CAGGCCAGCC TGGCGTTGGC TCTCGCGTTG TCCTCCCAGC TGCGGGCGAT GGCGGCCGGC GGCGCAGCCA CGCTCGACTC GCTGTTCCTC GACGAGGGCT TCGGCACCCT CGACGAGGCC ACCCTGGACG TCGTCGCGAG CACCCTGGAG AGCCTCGCGA GCGGCGGCGG CCGGATGGTC GGCCTGGTCA CCCACGTCCA GGCGCTGGCC GAACGGGTCC CCGTACGCTT CGCGGTGAAC CGGGACCAGC GGACGTCGAC CGTCGTCAGG GAGCGCACTT GA
|
Protein sequence | MRPVQLDLAG FGSFREPAVL DLTDADYFAL VGPTGAGKST LIDALTFALF GSVPRWNNRA TVHLALAPTA ARGTVRLVFD AAGARYVVAR ELRRAARGGV TVRNARFERL VDPAGAGGPG EPTDVLAAGA PAVTAAVEEL LGLTFEHFCT CVVLPQGEFA EFLRAKPSER QAILTRLLGL GVYDTIRAAA NARASDQRQR ADVLAEQLAG YADATAAAEA AAAGRVGALT RLAAEVAAAL AEVARLDTAA AEAAARTARL GAERDQLAAV EPPAGLVDLT GRLRAARAAD ADARRRRQEA ERADSAARGD RAAAQPRAPL DRYARDHREL ADLLATRPAA VTEHAEADAA LARADAELGA ARAALAAATA AERDAEQAAV SAAAELSRLD SELGLLADVR APAGLAALTG RLRGAEAARA EALERREAAE RADTAVREER AAAPSRGPLE RALRDHELLV ELLAGQAAAA AAHADVQAAL GRADAALEEA RHTRHRAAGA RDDAARADLA AALRPHLAVG EDCPVCARPV ATLPPPLPAA GLGTADEAVT AAEREVTARA AEATAATRAA AESEARLADL TRRVEELRRA VADAPDARRA AGLLADVERL DGAVARADLA LRRARDAAAA ADTVVTRSRD GAAGARRALD AVRDPLVSLG APGLAGTDLE ADWATLVTWA AELARDRTGQ RDAAAERSAA ATASLAAARQ ALALASAGQA RADENRIAEA RAQQRAATRV ATLDDRAEAL RVVLASAPTA VEVAAALAEL DRLAATADQA DQRLRAARAE ADDAERAFER ARDALGAAGG ALAAARDGLV PLGAPAVDSE DVLAGWTALT DWSRTQATAR DRLLAAATAE AEDARRRHDE ADTALVELLA AHDLPPGPGS KAADGASVAV ADGLAGARAE LARVVERRAE AGRLTAEVAT AREADHVARQ LGQLLRSDRF PRWMVAAALD ILVEEASVTL SELSGGQFEL THEDGEFVVV DHADADSRRP VRTLSGGETF QASLALALAL SSQLRAMAAG GAATLDSLFL DEGFGTLDEA TLDVVASTLE SLASGGGRMV GLVTHVQALA ERVPVRFAVN RDQRTSTVVR ERT
|
| |