Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1671 |
Symbol | |
ID | 5670073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1997602 |
End bp | 1999305 |
Gene Length | 1704 bp |
Protein Length | 567 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641240589 |
Product | HNH endonuclease |
Protein accession | YP_001506015 |
Protein GI | 158313507 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.953855 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTCCC TTCCCGGCTC GAGGCGTGGC GCATCGCCAC GCGGGCAACG CACCGCCGCC AACGCGCCCG ACGCCAAGGT CGGATCCGCC GCCACATCCA CCGGATCAGA CGGCCGACCC ACACCAGGGG ATGAGGCCGC CACCGTGGTC GGGAGTCCCC CGACGCACTC GACCAGTTCA GGAGAGGACT CGTGGGCAAC CAGCGGCGAC ACCCGCACCC GCGAGGCGCT CGACACGAAA GCTGACGATC TCTTCAGGGA GATCTCTCGC GTCGTCCGCA CGATCGCGGC CGGCCACGCG GATCTACTGA CCCTGCTGGC TCAGTTCGCG GCCCTCCGGC TCCCGCCAAG CGCCGAGCAA TCACTCGTCT CCGACGAGTT CGCACCCGAA GAGATAGCCA CCGCGATGGC GGTCTCGCCC CAGGCGGCCT CCGCCCAGCT GACGTTCGCC TGCACCGTCA CCCGGCGCCT TCCGAACGCA GTCGACGCGC TCAGGGCGGG AGTCCTGGAT CTCCAACGAC TGCAGTCACT GGAGAAAGCG GTCCACCCGC TCGACGACCG CACCGCAGCC AGAGTCGAAG CACACGTGCT CGCCGGAGGA GCACGACCGA ACCGGCGCGC CTTCACCGAT GCCTGCAGGC GCGCCGTAGC CAGGATCGAC CCCGAAGGCG CGGCAGACCG CGCGGAGACC CGGCGCAAGG ACCGACGGGT GTGGGTCTCT CCCGACGAGG ACGGGACAAG CGCGCTCACA GCCGTCCTTC CCACGGAGGA CGCCGAGGCC TGCTACCAGC GGATCGATCA GACTGCTCGT TCCATCAGTG CCAACAGATC CGACGGTGAC GACCGGACCC GCGCCCAGAT TCGTGCCGAC GTGCTGGTCG ATCTACTCAC GGGCCGAGGC AGCCACGCCA CGCCGATGCC GTGCGACATT CAAGTCGTCG TACCCGCGAC GACCCTGATG GGCCCGTCCA ACGACCCCGG CGAGATTCCG GGGTACGGAC CGATCCCCGC ACGAATAGCC CGGGAGATGG CCAAACGGCC TGGCTCAACC TGGCGGCGGA TTCTGACCGA TCAACAAGGA CATCTGATCG AGGTCGCCGA CCGGCGTCTA CCAAGCGCCG CACAGGTCCG CCATGTTCGT GCCCGCAACC GGACCTGTGT ATTCACAGGC TGCACCCGGC CATCACGGCG CACGGATACC GATCACACCA TCTCCTTCTC CGAGGATGGA CGGACTCTGA CCAAGAATCT CGGTCCGCTG TGCCGTCGAC ACCACCGGAT GAAACATCGT GGCCACTGGC AGGTCACACA ACCCCAGGAA GGAACCTTCG TGTGGACCAG CCCACTCGGG CGAACCAGTG TCACCACCCC TGACACATAT CTCGAAGTCG ATGTCTCACC CGGGACTACG ACCAAGAATC GGCACGCCGA GCACTTGAGG GAAGATCACG GGTACACGCG ATCCCGCCCG AAAACGCCGA CCGGGGCGGG AAAGGATCCT GACTGGCAGA TGCCATCCCA GACAAGGCCT GACGTGGCCC GGCGTGAGTC ATCTGCTTGG CGTCAGGCAG GATCGGGGCG GGACCTGCGC CGGAGGGAAC CGGGCGGCAG AGAAATCGAG GGAGTCAACC AACGTGAGGA ACACAACCGC GCCGCCGGAG CTCGCGAGCC GAGTCGGATC CGTCCAGACC TCTCCGGTAC GTGA
|
Protein sequence | MTSLPGSRRG ASPRGQRTAA NAPDAKVGSA ATSTGSDGRP TPGDEAATVV GSPPTHSTSS GEDSWATSGD TRTREALDTK ADDLFREISR VVRTIAAGHA DLLTLLAQFA ALRLPPSAEQ SLVSDEFAPE EIATAMAVSP QAASAQLTFA CTVTRRLPNA VDALRAGVLD LQRLQSLEKA VHPLDDRTAA RVEAHVLAGG ARPNRRAFTD ACRRAVARID PEGAADRAET RRKDRRVWVS PDEDGTSALT AVLPTEDAEA CYQRIDQTAR SISANRSDGD DRTRAQIRAD VLVDLLTGRG SHATPMPCDI QVVVPATTLM GPSNDPGEIP GYGPIPARIA REMAKRPGST WRRILTDQQG HLIEVADRRL PSAAQVRHVR ARNRTCVFTG CTRPSRRTDT DHTISFSEDG RTLTKNLGPL CRRHHRMKHR GHWQVTQPQE GTFVWTSPLG RTSVTTPDTY LEVDVSPGTT TKNRHAEHLR EDHGYTRSRP KTPTGAGKDP DWQMPSQTRP DVARRESSAW RQAGSGRDLR RREPGGREIE GVNQREEHNR AAGAREPSRI RPDLSGT
|
| |