Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0240 |
Symbol | |
ID | 4242395 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 375832 |
End bp | 377736 |
Gene Length | 1905 bp |
Protein Length | 634 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638105584 |
Product | RNA-directed DNA polymerase |
Protein accession | YP_720200 |
Protein GI | 113474139 |
COG category | [L] Replication, recombination and repair [V] Defense mechanisms |
COG ID | [COG1403] Restriction endonuclease [COG3344] Retron-type reverse transcriptase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.816988 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0174648 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAAG CTAAAACCCT AAAAAGAAAT TTAGAAAATC CTAAACCATT TTTAAGCGTA GCATGGGATA CAAACGATAT ACCGGACCTA TGTAGTGTCA ATCCTAATCT TAAATGGAAG GACATCAACT GGAAGAAGGT AGAAAAATAT GTATTTAAGT TGCAAAAGTT AATTTACAGA GCATCCAGCC GTGGCGAAAT CCGCAAAATG CGTAAATACC AAAAACTTCT GACCAAAAGT TATTATGCAA GGTTGCTAGC TGTTAGGCGT GTGACTCAGG ACAACCAGGG AAAGAAAACT GCTGGTATAG ATGGTATAAA AAGCCTTCCC CCAATGCAGA GGTTGAACCT GGTAGAAATG TTAAAGTCAC AATATCTTAA AGCAAGCCCA ACCCGTAGAG TCTGGATACC AAAACCAGGT AGAGAAGAAA AACGCCCATT AGGCATACCT ACTATGTATG ATAGGGCACT TCAAGCACTG GTAAAGTTAG GTAGGTCACC AGAATGGGAA GCACTTTTTG AGCCTAACAG TTATGGTCTT TTCCGGAGGT CAGCACATGA TGCTATTGCA GCAATCTATG TCAGTATTAA CCAAAAACCA AAATATGTAT TAGATGCTGA CATATCCAAA TGTTTCGACC GAATTAACCA TGATGCACTA TTGAGAAAAA TAGGGCGAAC ACCTTACAGA CGATTAATCA AACAATGGTT AAAATCTGGA GTATTCGACA ATAAACAATT CTCAAACACT GTGGAAGGTA CACCACAGGG AGGGGTTATC TCACCCTTAC TAGCAAACAT CGCCTTACAC GGTATGGAAA AATGCTTAGA AGATTACGCC GAAACTCTTT CAGGGAGTAA ACGTGATAAT AAACATGCAT TGTCCCTAAT ACGTTACGCT GATGACTTTG TAATCCTACA CAAAGACATC AAAATATTGT TACAAGCAAA AACCGTAATA CAGGAATGGT TAAACCAGGT AGGATTAGAA CTAAAACCAG AAAAAACCAG AATTGCCCAC ACTTTGGAAG AATATGAAGG TAACAAACCT GGATTTGACT TCCTAGGATT CACAATAAGG CAATGGAAAT TCAAGACAAC CAAACAAGGA TTTAAGACAC TGATAAAACC GTCCTCTAAG AGTATAAAAA CTCATTATCG GAGGCTAGCG GATATATGTG ATAACCACAA AACTGCTCCT AAAAAAGCTT TAATAGCTAA ACTTAATCCG ATAATTAGAG GATGGGCCAA CTACTTTTCC ACTGTAGTCA GTAAAGAAAC CTTTTCAAAG TTAGACCACC TAGTTTGGAA AAGGATAGGT CGATGGGCAA GTAGACGGCA TCCAAACAAG TCAGCCAAAT GGGTCAAGAG TAAGTATTTC CCTCGCTGCA AAGTCACCAG AAACTGGTTA CTTAACGACG ACGAATATAT ACTTAACCGG CACTCAGACG TTGCTATAGT AAGACACGTC AAGGTAAAAG GTAATAAATC CCCATTAGAC GGTGATTGGA CTTATTGGAG CAGTAGAATT GGTAAACATC CAGGCATAAG GAAAGAAGTC ACAACGCTGT TAAAGCGACA AAAGAACAAA TGCGCATTTT GTGGACTAAC TTTCAGATCA AACGACCTCA TGGAAATTGA CCATATAAAA CCAATATCTG AAGGCGGTGA TAACACAGTT AAAAATAAAC AACTGTTACA CCTACATTGC CACGATACTA AAACTGCTTT AGATAATATA ACATACACAA GACCCAAGTT ACAGGACTTA CCTGATAAAT ACCTGTGGGT GAATGATATG TTAATTCTAA AACAAGGATG TACCTATGAA AAAGGACGTT TAGGAGAGGA GCCGGATGAG GTGAAAGTCT CACGTCCGGT TTTGAAGACG AGTCGGGTAA GGTAA
|
Protein sequence | MNKAKTLKRN LENPKPFLSV AWDTNDIPDL CSVNPNLKWK DINWKKVEKY VFKLQKLIYR ASSRGEIRKM RKYQKLLTKS YYARLLAVRR VTQDNQGKKT AGIDGIKSLP PMQRLNLVEM LKSQYLKASP TRRVWIPKPG REEKRPLGIP TMYDRALQAL VKLGRSPEWE ALFEPNSYGL FRRSAHDAIA AIYVSINQKP KYVLDADISK CFDRINHDAL LRKIGRTPYR RLIKQWLKSG VFDNKQFSNT VEGTPQGGVI SPLLANIALH GMEKCLEDYA ETLSGSKRDN KHALSLIRYA DDFVILHKDI KILLQAKTVI QEWLNQVGLE LKPEKTRIAH TLEEYEGNKP GFDFLGFTIR QWKFKTTKQG FKTLIKPSSK SIKTHYRRLA DICDNHKTAP KKALIAKLNP IIRGWANYFS TVVSKETFSK LDHLVWKRIG RWASRRHPNK SAKWVKSKYF PRCKVTRNWL LNDDEYILNR HSDVAIVRHV KVKGNKSPLD GDWTYWSSRI GKHPGIRKEV TTLLKRQKNK CAFCGLTFRS NDLMEIDHIK PISEGGDNTV KNKQLLHLHC HDTKTALDNI TYTRPKLQDL PDKYLWVNDM LILKQGCTYE KGRLGEEPDE VKVSRPVLKT SRVR
|
| |