Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1225 |
Symbol | |
ID | 3902970 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 1464459 |
End bp | 1467635 |
Gene Length | 3177 bp |
Protein Length | 1058 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 637878558 |
Product | putative exonuclease |
Protein accession | YP_480332 |
Protein GI | 86739932 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0419] ATPase involved in DNA repair |
TIGRFAM ID | [TIGR00618] exonuclease SbcC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCCCC ACCACCTGAC CCTGGCGGCG TTCGGCGCCT TCCCCGGCAC CGTCGAGATC GATTTCGACG TCCTCGGCAG CGGCGGCCTG CTCCTGCTCT GCGGGGAGAC CGGGGGCGGC AAGACGACGC TGCTCGACGC GGTCGGGTTC GCGCTGTTCG GCCGGGTGCC GGGGATGCGC GGGGAGGTCT CCGGGCCGCC GGACCTGCGC TCGCACCACG CCGCGGCATC CCTACGGCCG GAGGTGACGC TGGAGTTCAC TGTGGCGGCG GGACGTTTCC GGATCACCCG CGGCCCCGCC TGGGACAAGC CGAAACGCGG CGGCGGGACG ACCCGCGCCC ATCCGACGGC GCGGCTGGAA CGGTTCGACG GCGCCGGCGG CTGGGAGACG GTCGCCACCC GCATGGAGGA CGTCGGCCAT GAGATCGACC TGCTCGTCGG CATGAGCGAC AAGCAGTTCT TCCAGGTCAT CATGCTCCCG CAGGGACGCT TCGCCGACTT CCTCCAAGCC GACCACGGTG CGCGCGAGAA GCTCCTGAAG CGGCTGTTCC ACGTCAACCG CTTCGAATAC GCCGAGCAGT GGCTGCTGGA CCAGGCAAAG ATCGCCGCCG AACGGCTGGC GCTGGCCCGC GCCGAGCTCG ACCGGGTGAC CGCCCGGGTC TCGCAGGTGG CGGCCGTCGA CGAGCCGGAG GATCCGACGG CTGACGCCGG CTGGGCCTCG GACCTCGCCA GGACCGCCGC GGCGGCGGCC ATCTCGGCCG ACGAGGCGGC CGGCGCGGCG GCAGCGCGGC GCACCGCGGC CGAGGAGGCC TTGGACCAGG CACGCGACCT CGCCCGACGC GTCGGGCAGC GCCGGGTACT GGCCGCCCGG CAGGAGGAGC TCGCGGCACA GGCGCCCCGG ATCGAGCTGC TCGCCACCGA GCTGGACGCG GCCCGCCGGG CCGCCGTCGT CGCACCCGCC CTGGGCGAGG TCCACCGCCG CGCCGCCGAG GTCCACCGGG CCGAGGCCGC CGAACGCGAC GCCCGGGACC GGCTCGGACG CCATCCATCC GGGCTCCTTC CCGGGGAACC CGCCCCCTCG GGACCGCTGC CCGAGGAGGG CACCGGGAGG CCTGCCGAGG CACCTGCCGA AGCACCTGCC GAGGAACTCG CCCGCCTGGC CCGCCTCGCC CACACCGAGA CCGGGCGACT GGGAACGTTG GCCGACACCC TCGCCGCCGC CGAGCGGGAC GCCGAGGAGG CCGCCGGGGC CGACCAGGAT ACGGCCGCCT ACACCCGGTC CGCCGCCGAA CTCGCCGAGG CGATCAGCGT CACGCTGCCC CGCGCCCGCG TCGCGGCCGA GGCCCGGGTG GAGGTCGCCC GGCGGGCCGG CGCCGCTCTG CCCGGCCTGG TCGAACGGGC CCGGTGGGCC AAGGAGCTGG CCTCCGCCGT CCGGGAGGGG CGGCAGGTGC GGACCGTGGC CGACGAGGCC GAACGAGACG CATCCGCGGC CCGGACCCAC GCATCCGACC TCCGGCAGCA GCGATTCGAC GCCATCACCG CCGAACTCGC CGCCGCGCTT GTCCATGACA CGCCCTGTCC GGTCTGCGGG GCCCTGGAGC ATCCCGACCC GGCCGAGACC CGGGCCGACC ATGTGAGCAA GGACGCCGAG ACCGCGGCCG GGCAGGAGGC GGACCGGCTC GCCGACGCCG CGACGAGGGC GAGCCGGGCG GTGGCCCACT GGGAGTCCCG GGTCCGGGCG CTGCACGCCG ACCTGGTCGG CCCAACGGAC CCGGACCACA GCGCGAACCC GGACCACAGC GCGAACCCGG ACCACAGCGC GAACCCGGAC CACAGCGCGA ACCCGGACGA CCGCGTCGAG GCGGCCTTCG CCGAGATCCG CGCGCTTCCG GTCGCGACGC TGCTGGGCGG CTCCGGGGCG CCGGTCGCGG ATCGGCTCGA TGAACTCGCC GCCGTGCTGA CCCACGCGGT CCGCGCCCGG ACCCGGACGG CGAAGAAGCT CGCCGCGGCG GAGGCCGCGT TGCGCGAGGT CCACGAGAAC GAGAAGGAGA CCGCGGCCCG CCATTCCGCC GCACGGACCG CGGCGCAGGC GGCCCGGGAA CGCGCGGCGG ACGCCCGGGA CCGCGCCGCC CGACGGCTCG CCGGCGTTCC GGCGGAGCTA TGCGACCCCG ACGCCCTGGC CGCGCGCCGC CGTGCCGTCA CCGCCCTCGC CGCCGATCAC GAGGCCGCGC AGGCCGCGGC CCTGGCTGCC GAGCAGGCCC GGGCCGAACA CATCCGCGCC GGGACGGCCG CCCTCGACCA GGCGCGGCAG GCGGGGTTCT CCGACCTGGA CGACGCCGCC GAGGCCGTGC GGGACTCCGA CTGGATGCGC CGCGCCGCGG ACGAGGCCCG GGCCCATCGG GACGAGCTCG TCGCCGTCGG GGCCAGGCTG GCGGGCGAGG ATCTCGCCGT CGATCCGGAC ACCGAGGTCC CGCTGGCCGA CCACGAGACG GCCGTGACCG ACGCCCGCGA GGTCCACGAG AGCGCGCTCG CCACGGCCGC ACGCGCCCGG GAACGGGCCG AACGGCTGGC CTCGCTGGTC ACGGAGTTCA CCGAGAAGCT CACTACCCTG GATCCGCTGC GGGAGGCGGC GGACGAGCTG CGGGGCCTCG CCGATCTGGC CGCCGGGCGG GGTGCCAACA CCGAGCGCAT GCCGCTGTCC AGCTTCGTGC TGGCGGCTCG GTTGGAAGAG GTGGCCGCCG CCGCCAGCCA TCGGCTCGCG GCGATGAGCA GCGGTCGGTT CACCCTGGTG CACGACGCGG GGGAAAGCCG CGACAAGCGT CGGCGCGCCG GCCTCGGGCT GCTGGTCGAC GACGCCTGGA CCGGCCGGCG GCGCGACACC GCCACCCTGT CCGGCGGGGA GACCTTCCAG GCGGCGCTGT CACTGGCCCT CGGGCTCGCC GACGTCGTGA CGGCCGAGGC GGGAGGCCGG CGGATGGACG CGCTGTTCAT CGACGAGGGT TTCGGCACGC TCGACCCGGA CAGCCTCGAC GAGGTGATGA CCGTTCTCGA CGAACTGCGT TCCGGCGGCC GGCTCGTCGG CGTCGTCAGC CATGTCACCG AGCTGCGCCA GCGCATCCCG AACCAAATCC GGGTCGTCAA AGGGGTCGGC GGCAGCCGGG TCGAGACCAC GTCCTGA
|
Protein sequence | MRPHHLTLAA FGAFPGTVEI DFDVLGSGGL LLLCGETGGG KTTLLDAVGF ALFGRVPGMR GEVSGPPDLR SHHAAASLRP EVTLEFTVAA GRFRITRGPA WDKPKRGGGT TRAHPTARLE RFDGAGGWET VATRMEDVGH EIDLLVGMSD KQFFQVIMLP QGRFADFLQA DHGAREKLLK RLFHVNRFEY AEQWLLDQAK IAAERLALAR AELDRVTARV SQVAAVDEPE DPTADAGWAS DLARTAAAAA ISADEAAGAA AARRTAAEEA LDQARDLARR VGQRRVLAAR QEELAAQAPR IELLATELDA ARRAAVVAPA LGEVHRRAAE VHRAEAAERD ARDRLGRHPS GLLPGEPAPS GPLPEEGTGR PAEAPAEAPA EELARLARLA HTETGRLGTL ADTLAAAERD AEEAAGADQD TAAYTRSAAE LAEAISVTLP RARVAAEARV EVARRAGAAL PGLVERARWA KELASAVREG RQVRTVADEA ERDASAARTH ASDLRQQRFD AITAELAAAL VHDTPCPVCG ALEHPDPAET RADHVSKDAE TAAGQEADRL ADAATRASRA VAHWESRVRA LHADLVGPTD PDHSANPDHS ANPDHSANPD HSANPDDRVE AAFAEIRALP VATLLGGSGA PVADRLDELA AVLTHAVRAR TRTAKKLAAA EAALREVHEN EKETAARHSA ARTAAQAARE RAADARDRAA RRLAGVPAEL CDPDALAARR RAVTALAADH EAAQAAALAA EQARAEHIRA GTAALDQARQ AGFSDLDDAA EAVRDSDWMR RAADEARAHR DELVAVGARL AGEDLAVDPD TEVPLADHET AVTDAREVHE SALATAARAR ERAERLASLV TEFTEKLTTL DPLREAADEL RGLADLAAGR GANTERMPLS SFVLAARLEE VAAAASHRLA AMSSGRFTLV HDAGESRDKR RRAGLGLLVD DAWTGRRRDT ATLSGGETFQ AALSLALGLA DVVTAEAGGR RMDALFIDEG FGTLDPDSLD EVMTVLDELR SGGRLVGVVS HVTELRQRIP NQIRVVKGVG GSRVETTS
|
| |