Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3305 |
Symbol | |
ID | 4243611 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 5068307 |
End bp | 5070214 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638108293 |
Product | RNA-directed DNA polymerase |
Protein accession | YP_722884 |
Protein GI | 113476823 |
COG category | [L] Replication, recombination and repair [V] Defense mechanisms |
COG ID | [COG1403] Restriction endonuclease [COG3344] Retron-type reverse transcriptase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00301539 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAATAAAG CTGAAATACT AAAAAGAAAA TTGGACAATC CAAGGCCCTT TTCAAGTGTG GCATGGGACA CGTACGATAT ACCTAACCAG GTTTGTGTTA ATCCCAATCT CAAATGGAAA GACATCAACT GGAAAAAGGT AGAAAAGTAT GTGTTTAAGT TACAAAAGTT AATCTATAGA GCATCCAGCC GTGGCGAAAT CCGCAAAATG CGTAAATACC AAAAACTTCT GACCAAAAGT TATTATGCAA GGTTGCTAGC TGTTAGGCGT GTGACTCAGG ACAACCAGGG AAAGAAAACT GCTGGTATAG ATGGTATAAA AAGCCTTCCC CCAATGCAGA GGTTGAACCT GGTAGAAATG TTAGGGTCAC GATTTCTTAA AGCAAGCCCA ACCCGTAGAG TCTGGATACC AAAACCAGGT AGAGAAGAAA AACGTCCACT AGGCATACCC ACTATGTATG ATAGGGCACT TCAAGCACTG GTAAAGTTAG GCATGGAACC AGAATGGGAA GCACTTTTTG AACCTAATAG TTATGGTTTT AGACCAGGAC GGTCAACATA CGATGCTATT GCAGCAATCT ATGTCAGTAT TAACCACAAA CCAAAATATG TTTTAGATGC TGACATATCC AAATGTTTTG ACCGAATTAA CCATGATGCA CTGTTGGGAA AAATAGGAAA ATCCCCATAT AGAAAATTAG TTAAACAATG GCTAAAATCC GGGGTATTTG ACAATAAACA ATTCTCAAAC ACTGTGGAAG GTACACCACA GGGAGGGGTA ATATCACCCT TGCTAGCAAA CATCGCCCTA CACGGTATGG AAAAATGCCT AGAAGATTAT GCAGAAACCC TCCCAGGGAC AAAGCGTGAT AATCAAAGAG CATTATCCTT AATACGATAT GCCGATGACT TTGTAATCCT ACATAAAGAC ATCAAAGTAT TGTTACAAGC AAAAACTGTA ATACAGGAAT GGTTAAACCA AGTAGGGTTA GAACTAAAAC CAGAAAAAAC CAAAATTGCC CACACTCTGG AAGAATATGA AGGAAATAAA CCCGGATTTG ACTTTCTAGG ATTTACAATA AGGCAATGGA AAGGTAAGAC AACCAAACAA GGATTCAAAA CACTGATTAA GCCATCATCT AAGAGTATTA AAACTCATTA TCGGAAGCTG GCGGATATAG GTGACACCTA CAAAACCGTC CCTACAAAAG CTCTAATAGC TAAACTTAAT CCGGTAATTA GAGGATGGGC CAACTACTTT TCCACCGTAG TCAGTAAAGA GGTATATAAT AAATTAGACT ACCTTCTATG GGAAAGATTA TGGAGATGGG CAAGTAGACG GCATCCAAAC AAGTCAGCCA AATGGGTCAA GAATAAGTAT TTTCCTCGCT GCAAAGTCAC CAGAAACTGG TTACTTAACG ACGGCGAATA TATACTTAAC CAACACTCAG ACGTTGCCAT AAAAAGGCAC GTCAAGGTAA AAGGCAATAA ATCCCCTTAT GACGGTGATT GGACTTATTG GAGTAGTAGA ATCGGCAAAC ACCCAGGTGT AAGGAAAGAA GTCACAACGC TGTTAAAACG GCAAAAGAAT AAATGCGCAT TTTGTGGACT AACCTTTAGA TCAAATGACC TCATGGAAAT AGACCATATA AAACCAAAGT CTGAAGGCGG TGATAACTCA ATTAAAAACA AGCAACTGTT ACACCGACAT TGCCACGATA CTAAAACTGC TTTAGATAAT AAAACATACA CAAAACCTAA GTTACAGGAT TTACCTGATG AATATCTATG GGTAAATGAT ATGTTAATTC TAAAACAGGG ATGTACCTAT GAAAAAGGAC GTTTAGGAGA GAAGCCGGAT GAGGTGAAAG TCTCACGTCC GGTTTTGAAG ACGAGTCGGG TAAGGTAA
|
Protein sequence | MNKAEILKRK LDNPRPFSSV AWDTYDIPNQ VCVNPNLKWK DINWKKVEKY VFKLQKLIYR ASSRGEIRKM RKYQKLLTKS YYARLLAVRR VTQDNQGKKT AGIDGIKSLP PMQRLNLVEM LGSRFLKASP TRRVWIPKPG REEKRPLGIP TMYDRALQAL VKLGMEPEWE ALFEPNSYGF RPGRSTYDAI AAIYVSINHK PKYVLDADIS KCFDRINHDA LLGKIGKSPY RKLVKQWLKS GVFDNKQFSN TVEGTPQGGV ISPLLANIAL HGMEKCLEDY AETLPGTKRD NQRALSLIRY ADDFVILHKD IKVLLQAKTV IQEWLNQVGL ELKPEKTKIA HTLEEYEGNK PGFDFLGFTI RQWKGKTTKQ GFKTLIKPSS KSIKTHYRKL ADIGDTYKTV PTKALIAKLN PVIRGWANYF STVVSKEVYN KLDYLLWERL WRWASRRHPN KSAKWVKNKY FPRCKVTRNW LLNDGEYILN QHSDVAIKRH VKVKGNKSPY DGDWTYWSSR IGKHPGVRKE VTTLLKRQKN KCAFCGLTFR SNDLMEIDHI KPKSEGGDNS IKNKQLLHRH CHDTKTALDN KTYTKPKLQD LPDEYLWVND MLILKQGCTY EKGRLGEKPD EVKVSRPVLK TSRVR
|
| |