Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5583 |
Symbol | |
ID | 5673911 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6767268 |
End bp | 6769892 |
Gene Length | 2625 bp |
Protein Length | 874 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641244437 |
Product | putative helicase |
Protein accession | YP_001509841 |
Protein GI | 158317333 |
COG category | [R] General function prediction only |
COG ID | [COG3973] Superfamily I DNA and RNA helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCGACGA CACGTGACGG AGAGCTCGCC CGTGAGCAGG CCTACGTCGA CACCCTCTAC GGCCGGTTGG ACGAGGTCCG GGAGACCACC AAAAGGCAAC TCCGACAGGT GCTGCTCGAG GCTGGCACCG GCACGCCGCA GTCGATCGTG GAACGCGACG TGTTCGCCGC GACGCATGCC GACCGGCTGG CCCGGCTCGA CGCCGCCGAG GGGCGGCTGT GCTTCGGCGC GATGGACCAC GCGGCCGGGG GGCGCACCTA CATCGGCCGG ATCGGGCTGT CCGACCAGGA GCAGGAGCCG ATCCTGGTCG ACTGGCGGGC GCCCGTGGCC ACCGCGTTCT ACCAGGCGAC CATCGCCGAT CCGCGTGGCC TGACCCGCCG CCGCCACCTG CGCACCCGCG GCCGGCGGGT CACCGGCCTC GCCGACGACC CGCTCGACCC CCGCGCCTAT CTCGCGCAGG CCGGCGCCGC CGACGGGGCT TCGGGGGCCG CCGAGGGCGA TGCGGCCCCC GAAGCCGGCG CGGGCGCCGA AGCCGGGTTC GGCGCGACCG GGGACACGAT GCTCCTCGAG GCGCTGTCCG CCCCGCGCAC GGGCCGGATG CACGACATCG TCTCGACGCT GCAGGCCGAA CAGGACAGGA TCATCCGGGC GGCGGCGAAC GCGGTGCTGG TGGTCGACGG TGGGCCGGGC ACCGGCAAGA CCGCCGTCGC GCTGCACCGT GCCGCATATC TCCTCTACAC CGATCGCGAC CGGCTCGTCC GGTCCGGGGT GCTCGTGGTC GGCCCCAGCC CCGTCTTCCT CCGCTACATC GAGCAGGTCC TGCCCTCCCT CGGCGAGACC GGAGTGGTGT TCGCGACCCC GGGACGGCTC TTCCCCGGCG TGGACGCGAC CGGCGAGGAC CCGGTCGCCG CCGCGTCCCT CAAGGGCGAC GCACGGATGG CCGACGTCAT CGCCGGCGCG GTCCGGGACC GCCAGCGGGC ACCCGGCCGA GGGGTGCGGA TCCGCCACGA CGAGCACGAC CTGCACCTGG ACCGCGACAC CATCGTCCGG GCCAGGACTC GGGCCCGCCG CAGCCGCCGC CCGCACAACT CCGCGCGCCG CGTGTTCATC CGCGAGCTGC TCGGCGCGTT GACGAACCAG GTGGTCTCCC GGTTGCCCGG TGGCCTCTTC GAGCCCGAGG AGCGCTCGGA GATCACTTCA GATCTGTGGG CGGACCCCGG GGTGCGGCGG GCCCTGAACG ACCTGTGGCC GCTGCTCACC CCGGCCCGCC TGCTGGCCGA CCTGTACGCC TCGCAGGAGC TCCTCGCCCG GGCCGCCGGT ACCCGGCTCA CCGCCGAGGA ACGCGCCCTG CTGCGGCGCG AGGGCCCGGC GGACGCGCCG GCGCGGCGCT CCGGCGAAAC CCGCTCCGCC GAAACCGGCC GGATCCGCTG GACTCCGGCG GACGTCGCAC TGCTGGACGA GGCCGCGGAG CTGCTCGGCG ACCCGGAGGA GGACGCCCGC CTCGACGAGG CGCGCCGCGC CGCCGCCGAG CGCGCGGCCG AGCGGGAGTA CGCCCGCGGC GTGCTGGAGA TGCTCGGCCT CGACGACCGG CTCGACGCCG ACACGGTCGC CGAGCGCTGG ACCGCACCCC GCCAGCGCCG CGGCGCCGCC GAGTACGCGA CCGGGGACCG GACCTGGACG TTCGGCCACC TCATCGTCGA CGAGGCCCAG GAGGTCTCGC CGATGCTGTG GCGCCTGCTT TGGCGGCGCT GCCCGGGGCG CACCGCGACG CTCGTCGGGG ATCTCGCCCA GGCGGCCCGC CCGCGGCCGC CGGAGAGCTG GGGCGAACTG CTGGGTCCGG CTGTCGGTGC CCACTACACC GTCGAGCGGC TCACCGTGAA CTACCGGACC CCCAGCGAGA TCATGGATGT CGCGGCCGAC GTGCTGACGG CGGCCGATCC CACGGCCGCG CCGCCCCTGT CGGTCCGCTC CGCCGGCCGG CGCCCGGACG CCGTCCGGAT ACCATCCGCC GACGAGGCGC TCCTGCGGGG CGTGGTCGAC GAGTCCGTGC GGGCCGCGGG CGAGGCGGCC GGGGGGCGGG TCGCCGTGAT CTGCCCTCCG GGGCGGACCG CGGCGATCCG GGCCGCCCTG CGCGCTGCCG TGCCCCAGCT CGCGCTGCCG AGGCAGCCCG ACGACCCGGA CGACCGCCCG ACCGGGACCC GGGAGCCGGC GGAGGCGGAC CTGCTGGACG CGCCCGTCGC CGTCCTCACC GTCGCGGAGT CCAAGGGCCT CGAGTTCGAC GCCGTGGTCC TCGTCGAACC GGCCGAGATC CTGGCCGGCC CGACCCGGGG CCTGGCGGAT CTCTACGTGG CGCTGACCCG GGCCACCCGC GTGCTCACCG TGGTGCACAC CGGGGAGCTC CCGGGCGTCC TGCATCGGAT GCCGGTGCGC GGTGGCCCGG CGACACCCGA GGCCGGCGCC GCCCCGAGGG CGACCAGTCG CACCGAAGCC GAGGCCCGCA CCGCCCCCGC CCACACCGAC ATCGATGACA CCGACACCGA TGACACCGAC ACCGGCGATG CCGACACCGA TGCCGGTCTC GGTGACCCGG TGGTCGGAGG CCGCCAGCTC AACCTGCTGT GGTGA
|
Protein sequence | MPTTRDGELA REQAYVDTLY GRLDEVRETT KRQLRQVLLE AGTGTPQSIV ERDVFAATHA DRLARLDAAE GRLCFGAMDH AAGGRTYIGR IGLSDQEQEP ILVDWRAPVA TAFYQATIAD PRGLTRRRHL RTRGRRVTGL ADDPLDPRAY LAQAGAADGA SGAAEGDAAP EAGAGAEAGF GATGDTMLLE ALSAPRTGRM HDIVSTLQAE QDRIIRAAAN AVLVVDGGPG TGKTAVALHR AAYLLYTDRD RLVRSGVLVV GPSPVFLRYI EQVLPSLGET GVVFATPGRL FPGVDATGED PVAAASLKGD ARMADVIAGA VRDRQRAPGR GVRIRHDEHD LHLDRDTIVR ARTRARRSRR PHNSARRVFI RELLGALTNQ VVSRLPGGLF EPEERSEITS DLWADPGVRR ALNDLWPLLT PARLLADLYA SQELLARAAG TRLTAEERAL LRREGPADAP ARRSGETRSA ETGRIRWTPA DVALLDEAAE LLGDPEEDAR LDEARRAAAE RAAEREYARG VLEMLGLDDR LDADTVAERW TAPRQRRGAA EYATGDRTWT FGHLIVDEAQ EVSPMLWRLL WRRCPGRTAT LVGDLAQAAR PRPPESWGEL LGPAVGAHYT VERLTVNYRT PSEIMDVAAD VLTAADPTAA PPLSVRSAGR RPDAVRIPSA DEALLRGVVD ESVRAAGEAA GGRVAVICPP GRTAAIRAAL RAAVPQLALP RQPDDPDDRP TGTREPAEAD LLDAPVAVLT VAESKGLEFD AVVLVEPAEI LAGPTRGLAD LYVALTRATR VLTVVHTGEL PGVLHRMPVR GGPATPEAGA APRATSRTEA EARTAPAHTD IDDTDTDDTD TGDADTDAGL GDPVVGGRQL NLLW
|
| |