Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4607 |
Symbol | |
ID | 5672952 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5489481 |
End bp | 5491262 |
Gene Length | 1782 bp |
Protein Length | 593 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641243468 |
Product | RNA-directed DNA polymerase |
Protein accession | YP_001508884 |
Protein GI | 158316376 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3344] Retron-type reverse transcriptase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGAGCG CCGCAACGGT GCTGGGTGTC CTGCGTGAAC GCGGCAAACG AGGGCTGCCG TGCGACGAAC TGTACCGGCA ACTGTTCAAC CCACAGATGT ATCTGCTGGC CTACGGGCGT ATCTACACCA ACCATGGTGC GATGACACCC GGAGTCACAC AGGAAACCGT GGACGGCATG TCCCAGGAGA GGATCGGTCG CATCATCGAT GCGATGCGCC ACGAACGCTA CCGATTCCAC CCCGTCCGGC GAGTCCACAT CCCGAAAAAG AATGGGAAGA CCCGCCCGTT GGGGCTACCG ACCTGGTCGG ACAAGCTCGT CGGGGAAGTT GTACGCCTGC TCCTGGAGGC CTACTACGAA CCGACGTTCT CCGACCGGTC CCACGGGTTC CGGCCACGGC GAGGCTGCCA CACCGCTCTG CGGGAGATAG CCAACACCTG GACCGGAACA GCCTGGTTCA TCGAAGGAGA CATCACCGAC TGTTTCGGGT CTCTCAATCA CGACCTGATG ATCGGAATCC TGTCGGAGAG GGTCCGCGAT AACCGGTTCC TGAGGCTGGT GCGTAACATG CTACGAGCCG GATACCTGGA GGACTGGAGG TGGGGGGCGA CTCTGTCGGG GGCTCCCCAA GGGGGCGTGG CCTCGCCGAT CCTGTCCTCG ATCTACCTGC ACAAGCTGGA CGAGTTCGTC GAGAAGATTC TGATTCCGGA GTACACCCGA GGGGGCCGCA GGGCGCGTAA CCCTGCTTAC CTCGTTCTGC AGAACGAGCT GGCCAAGGCA CGCCGACGTG GTGACCGGGG CCAAGCCCGG ATACTACGAC GGCGAATGGT CAGCCTGCCC AGCTCCGATC CCGATGATCC GGGATATCGG CGGCTGCGTT ACTGCAGATA TGCGGATGAC CATCTACTCG GGTTCACCGG ACCGAAGGCC GAAGCCGAGG AAATCAGACA GCGGCTGGCG GAGTTCCTGC GTGACGACCT CAAGTTGGAA CTGTCCGCCG ACAAGACGCT GATAACCCAC GCCCGTACCG GCGCGGCCCG TTTCCTCGGC TACGAGATCA CTGTCCAGCA CAACAGCAGC AAGACCACCC GGCGCCGTCG ATCGGTCAAC GGGCAGGTCG CGCTACGGGT GCCACGCGAT GTGATCAAAG CTAAAAGTGT CCCCTACCTG CACCGCGGAA AACCCGCTAA GCAGAAGGCC CTGACCAACG GTAACGACTA CACCATCGTC GCCACCTACG GGGCCATCTA CCGGGGCATC GTCCAGTACT ACCTGCTGGC CGGAGACGTC CACCGACTTC ACCGGCTGCG CTGGGTCATG GAGACATCCA TGCTCAAGAC CCTGGCAGGC AAGCACCGCT CGTCGGTGTC GAAGATGGCA GCCAAACACA AGGCCAAAAT CCAGACACCG CACGGGCCAC GCACCTGCTT CGAGGCACGC ATCGAACGCG ACAGCAGAAA ACCACTGGTC GCACGGTTCG GTGACATCCC ACTCCGCCGG CAGAAAACAG CGACGGTCTC TGACCGTCAG CCGACCCGGG TGGACTATCC GCACAAGGAA CTCCTCACCA GGCTCCTCGC GGATATCTGC GAAGTCTGCC AGCGCACGGG CAACGTTGAA GTTCACCACG TCCGCGCGCT CAAAGACCTC GCGGCACCCG GCCCTCTGCC GCCCCCGTGG GTCACAGCCA TGGCAAACCG CAGACGCAAG ACACTCGTGG TCTGCGCTAC CTGCCACGGC CAGATCCACA AGGGGCGGCC CGCCACACCG CTCACGCAGT AG
|
Protein sequence | MQSAATVLGV LRERGKRGLP CDELYRQLFN PQMYLLAYGR IYTNHGAMTP GVTQETVDGM SQERIGRIID AMRHERYRFH PVRRVHIPKK NGKTRPLGLP TWSDKLVGEV VRLLLEAYYE PTFSDRSHGF RPRRGCHTAL REIANTWTGT AWFIEGDITD CFGSLNHDLM IGILSERVRD NRFLRLVRNM LRAGYLEDWR WGATLSGAPQ GGVASPILSS IYLHKLDEFV EKILIPEYTR GGRRARNPAY LVLQNELAKA RRRGDRGQAR ILRRRMVSLP SSDPDDPGYR RLRYCRYADD HLLGFTGPKA EAEEIRQRLA EFLRDDLKLE LSADKTLITH ARTGAARFLG YEITVQHNSS KTTRRRRSVN GQVALRVPRD VIKAKSVPYL HRGKPAKQKA LTNGNDYTIV ATYGAIYRGI VQYYLLAGDV HRLHRLRWVM ETSMLKTLAG KHRSSVSKMA AKHKAKIQTP HGPRTCFEAR IERDSRKPLV ARFGDIPLRR QKTATVSDRQ PTRVDYPHKE LLTRLLADIC EVCQRTGNVE VHHVRALKDL AAPGPLPPPW VTAMANRRRK TLVVCATCHG QIHKGRPATP LTQ
|
| |