Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2197 |
Symbol | |
ID | 6064974 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 2411676 |
End bp | 2412998 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641601604 |
Product | RNA-directed DNA polymerase (Reverse transcriptase) |
Protein accession | YP_001725163 |
Protein GI | 170020209 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3344] Retron-type reverse transcriptase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.298068 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGTGG AGCGAAGGGG CAGCGTCGGG CAGTCTGCAT CACAGCCCAA CTGTAAACAG GAGGAGGCTG CGGGTGAACA GACAAAACCG TTTCAGGTCA GTAAATTGCA TGTGGTGGAA GCGTACCGGC GGGTAAAGGC GAATGCCGGA GCTGCCGGAG TGGATAACCA GACGCTGAAA GATTTTGAGC GGGATCTGAA AGGAAATCTG TACAAAATCT GGAACCGGTT ATCGTCGGGA AGCTGGATGC CGCCGCCAGT GCGTGCGGTG GAAATTCCGA AGAAGGATGG CAGTAAAAGA CTTTTAGGAA TACCGACAGT AAGCGATCGA ATAGCGCAGA TGACGGTGCT GGTAACGTTT GAACCATTAG TGGAGCGTTA TTTCCTGAAT GATTCATACG GATACCGGCA TGGAAAATCA GCGCTGGATG CAATAGCAGT CACCAGAAAA CGTTGCTGGC AATACGACTG GTATCTTGAG TTTGATATTA AAGGTTTATT CGATAATATC CCCCACGATT TATTACTCAG GGCAGTGGAT AAACATTGTG CGGATAAATG GGTAAGACTG TCTATTCGCA GGTGGCTGAC AGCGCCGGTG CAGATGCCGG ATGGAACACT AAAGGAAAGA AATAAAGGTA CGCCACAGGG TGGAGTCATC AGTCCGGTAC TGGCGAATTT ATTTCTGCAC TATGTGTTTG ATAAATGGCT GTCGTTACTT TATCCGGAAA TTCCCTGGTG TCGTTATGCA GATGATGGAT TAATTCATTG TGGCAGTAAA CAACAAGCAG AGGAATTACT GAATAAACTG GCAAAACCTT TTCAGGAATG TGGACTGGAA TTACATCCGG AGAAAACGAA AATAGTTTAC TGCAAAGACA GCGAACGACA GGCGAACCAC GAAACAGTGC AGTTTAATTT TCTGGGGTAT ACGTTCAGAG CGCGAAGAGC ACGTAATCAG CGAAGGGGAA ATCTCTTTAC GAGTTTCAGT CTGGCGGTGA GTAACAGTGC GCAGAAAGAC ATGATCGGAA AACTCAGGAA ACTGCGGCTC AGACGCAGGG TGGAAATGAG TCTTGAAGAT ATTGCGAAAA GGCTGAATCC GATGATATCG GGGTGGCTGA ATTATTACGC GAAATACTAC AAGTCAGCGA TGAAGAAAGT ATGCAGATAT ATTAATCTGA CGCTGATTGC GTGGGCGAGA AAGAAATACA AGACCCTGCG GTATAAAAAG ACGAAGGCGT GTCAGTTAAT GGAAAGACTG TCGAAAGAGA AGCTGGAGCT ATTTGCCCAC TGGAAAGCAG GACCAGGAAG CGCGTTTGCC TGA
|
Protein sequence | MNVERRGSVG QSASQPNCKQ EEAAGEQTKP FQVSKLHVVE AYRRVKANAG AAGVDNQTLK DFERDLKGNL YKIWNRLSSG SWMPPPVRAV EIPKKDGSKR LLGIPTVSDR IAQMTVLVTF EPLVERYFLN DSYGYRHGKS ALDAIAVTRK RCWQYDWYLE FDIKGLFDNI PHDLLLRAVD KHCADKWVRL SIRRWLTAPV QMPDGTLKER NKGTPQGGVI SPVLANLFLH YVFDKWLSLL YPEIPWCRYA DDGLIHCGSK QQAEELLNKL AKPFQECGLE LHPEKTKIVY CKDSERQANH ETVQFNFLGY TFRARRARNQ RRGNLFTSFS LAVSNSAQKD MIGKLRKLRL RRRVEMSLED IAKRLNPMIS GWLNYYAKYY KSAMKKVCRY INLTLIAWAR KKYKTLRYKK TKACQLMERL SKEKLELFAH WKAGPGSAFA
|
| |