Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1813 |
Symbol | |
ID | 5670215 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2176173 |
End bp | 2177984 |
Gene Length | 1812 bp |
Protein Length | 603 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641240734 |
Product | hypothetical protein |
Protein accession | YP_001506157 |
Protein GI | 158313649 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0322] Nuclease subunit of the excinuclease complex [COG2176] DNA polymerase III, alpha subunit (gram-positive type) |
TIGRFAM ID | [TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0953081 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0279118 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTGCCCAG TCCCTGCGCC GACCGGCGGC GTGCCGCGCC AGGCCAGCCT GGCAGAGCTC GGCCGGCCGC TGTCGGACCT GACGTTCGTC GTCGTCGACC TGGAGACCAC CGGCGGCTCG CCGGCGACGA GCGAGATCAC CGAGATCGGC GCCGTGCGGG TTCGGGGCGG CCAGATCCTC GGTGAGATGT CGAGCCTGGT CCGGCCGTCG GCCCCCATCC CGGCCTTCAT CTCCGTGCTC ACCGGCATCA CCGGCGCCAT GGTCGCCACC GCCCCGGGCA TCGGCGAGGT CGTGCCGACG TTCCTGGAGT TCGCCCGGGG CGCGGTGCTC GTGGCTCACA ACGCCCCGTT CGACCTCGGC TTCCTGCGCG CCGCCGCCAC GGCGTGCGGC TACCCGGCCC CGGCCTGGGA ACATCTCGAC ACGGTGCGGA TCGCGCGCCG CGTCATCAGC CGCGACGAGA CCCGCGACTG CCGCCTGTCC TCCCTGGCGG CCCTCTTCGG TAGCGCGACC CAGCCGAACC ACCGGGCGCT GGCCGATGCC CGCGCGACGG TCGACGTCCT GCACGGGCTG TTCGAGCGGC TGGGCAACCT GGGTGTCACC ACGATCGAGG ATCTGCACGA GTACAGCGCG CGGGTCTCCC CCGCCCAGCG GCGCAAGCGG CACCTCGCCG ACGACCTGCC CACCGGCCCG GGGGTCTACG TGTTCCGCGA CGGCACGGGG CGCCCGCTGT ACGTCGGCAC GTCCCGGTCG GTCCGCTCCC GGGTGCGTAC CTACTTCACG GCCAGCGAGC CCCGCACCCG GATGGCGGAG ATGGTCGCGA TCGCCGAGCG GGTCGACGCG ATCGAGTGCG CGCACGCGCT CGAGGCGGAG GTGCGCGAGC TGCGGCTGAT CGCCGAGTAC AAACCGCCGT ACAACCGCCG CTCCCGGTTC CCCGAGCGGG CCGTCTACCT GCGGCTCACC GACGAGCCGT TCCCCCGGCT CTCCCGGGTC CGCTCCGTCG GCGACGGTGT GACGTCGCTG GGGCCGTTCG GCAGCGCGGC GGCGGCCGAG TCGGCGGCCA CGGCGCTGCT GGAGGCGATC CCGCTGCGCC AGTGCTCGAC CCGGCTCTCG CCGCGCCGTC CGACGGCCGC GTGCGCGCTG GCCGAGCTGG GGCGCTGCGG CGCGCCGTGC GACGGCCGGG AGGGGGTCGC CGAGTACGGC CAACACGTCG CCACGGCGCG CGGCGCGATG ACCGCCGATC CGCGTCCCGT CGTGGACGTG CTGGAGCGGC GCATCGCGCG GCTGTCCGCC GACCAGCGCT ATGAGGAGGC CGCCGGGGTC CGCGACCGGC TCGCGGCCTA CGTGCGGGCC GTCGCGCGCA TGCAGCGGCT GACGGCGCTG ACCTGCATCG ACGAGTTGGT CGCCGCCGCG CCGACCGCCG ACGCCGGGTG GGATCTCGCC GTCGTCCGCC GTGGCCGGCT GGTGTCCGCG GCGTCGGTGC CGCGCGGCAC CGACCCGAGG CCCTGGGTCG ACGCCGTGGT CGCCAGCGCG GAGACCGTCC GACCACTGCC CGGCCCCACC CCGTGCGCCT CGGTCGAGGA GACGGAACGG ATCGGGCGGT GGCTGGCCGG GCCCGGCGTG CGCCTGGTCC GGCTGGACGG CGAGTGGAGC TGGCCGGCGC ACGGCGCGAT CCGCGCGGCG CGGCGGTTCG ACGTCCGCTT CGACGGCGGT TTGGACGGCG GGTTCGACCG CGGGTTCGAC TCCCCCACCG ACACCCGACG CGGGCGCGCG CCCAGCAACC CCCGGAGCGG CCGCGAGCCG CGGAAACGCT AG
|
Protein sequence | MCPVPAPTGG VPRQASLAEL GRPLSDLTFV VVDLETTGGS PATSEITEIG AVRVRGGQIL GEMSSLVRPS APIPAFISVL TGITGAMVAT APGIGEVVPT FLEFARGAVL VAHNAPFDLG FLRAAATACG YPAPAWEHLD TVRIARRVIS RDETRDCRLS SLAALFGSAT QPNHRALADA RATVDVLHGL FERLGNLGVT TIEDLHEYSA RVSPAQRRKR HLADDLPTGP GVYVFRDGTG RPLYVGTSRS VRSRVRTYFT ASEPRTRMAE MVAIAERVDA IECAHALEAE VRELRLIAEY KPPYNRRSRF PERAVYLRLT DEPFPRLSRV RSVGDGVTSL GPFGSAAAAE SAATALLEAI PLRQCSTRLS PRRPTAACAL AELGRCGAPC DGREGVAEYG QHVATARGAM TADPRPVVDV LERRIARLSA DQRYEEAAGV RDRLAAYVRA VARMQRLTAL TCIDELVAAA PTADAGWDLA VVRRGRLVSA ASVPRGTDPR PWVDAVVASA ETVRPLPGPT PCASVEETER IGRWLAGPGV RLVRLDGEWS WPAHGAIRAA RRFDVRFDGG LDGGFDRGFD SPTDTRRGRA PSNPRSGREP RKR
|
| |