Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4800 |
Symbol | |
ID | 4246454 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 7378708 |
End bp | 7380708 |
Gene Length | 2001 bp |
Protein Length | 666 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638109648 |
Product | group II intron, maturase-specific |
Protein accession | YP_724224 |
Protein GI | 113478163 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3344] Retron-type reverse transcriptase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.256541 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAAG CTAAAACACT AAAAAGAAAC TTAGGAAACC CTCAACCATT TTTAAGTGTG GCATGGGATA CAGAGGATAT ACCGTCTCAA GTTAGTATCA ATCTAACTCT TAAATGGAGA GACATAGACT GGAGGCGGGC TATGAAGAAT GTATTTAAGT TGCAAAAGTT AATTTACAGA GCATCCAGCT GTGGCAAAAT TCGCAAGATG CCTAGATACC AAAAACTTCT GACCAAAAGT TATTACCCTT TGTTGCTAAC TGTTAGATGG GTGAGACAGG ATAATCAAGT TAAAAACACC GTCGGGGTAG ATGGTATTAA AAATCTACCA CCAATGCAAC GTTTTAATTT TGTGTACCTA CTCAAATCAC ATTCCCTGAA AGCCATTCTA AACCCTAGAG TCTCAATACT AAAGCAAGGG TTAAATGCAA AATTTTCCTT AAGCATCCCA ACAATGTACG ACCGAGCATT ACAGGGCTCG GTCAAGCTAA GTATGGAACC AGAGTGGGAG AAACGCTTTG AACCTAATAG TTATGGTTTG GGACTAGGAC GATCAACCAA TGATGCTATT GAAATCATCT TTACCAGCAT CAAGAATAAA TCTAAATACT TTCTTGATGT TGATATATCC AAATGTTTTG ATCTATTTGA TCTAATTAAC CATGATGCAC TGCTAGGAAA AATTAAGAAA TCATCCTACA GACGACTAAT CAAACAATGT TTAAAGTCTA GAGTTTTAGA CAATAATCAA TTCTACCCAC CCCAACAGCT CCTATTCACC TACCCCGACA GCTTCCTTCT CGGAAATAGA TGTGGGTTGG TTGAGTTGGA TACAGCACAG GGAGAGGTAA TGAGTCCTCT GCTAGCAAAC ATCACCTTAC ACGGTATAGA GAAAAGACTA ATGGAGTTTG CCAAAACCCT AGATTTGAAA AATAAAAAGG GTGCTCAAAT GAGCTGGCAA TCGAAATGTC AAAGTCTAAT TCTAGTAGAC TATGTAGATG AGTTCGTAAT TCTACAGGAA GATATCAAAG TACTACTGCA AGCAAAAACC GTAATGCAGG AATGGTTAAA CCAGGTAAGA TTAGAATTAA AACAAGAATT GACCAGAATT GCCAACGCTC TAGAAGAGTA TAAAGGGAAC AAGCCTGGAT TGGACTTCTT CGGATTCACA ATGAGGCAAG GTCAGGCAAA GATAGCAAAA CTTGGATTTC AAACCCTGAT TAAATTGTCC GCTAGAAGTA TTAAAACTCA TTACCGGAAA CTGGCAGAAA GATTTGATTC ATACAAAATT GCACCAGTCA AAGGATTAAT TGCTAAACTT AATCCAGCTA TCAGTGGATG GGTTAACTAT TTCTCAACAC AAGTAAGTAC TAATAAAATA TTCAACAAAC TGGATATGCT TCTGTGGAAG AGACTATGGT GTTGCGCAAG TAGACAGCAT CCAAACAATT CAGCCACCTG GGTTAAACAA GAGTATTTCC CGAATATTGA GAATGGAAAT TGGATTCTCC CGCGGGGCGA ATATATGCTA AATCAATACT CTGATGTTCC CATCATAGGA GACATCAAAG TAAAAGATAA TAACTTAACA CTAGATGGTG ACTGGAATTA TTGGACTAGC AGAGTTGGCA AATATTCAGG GGTAAAAACA GAGACTTCAA AATTATTCAA AAGTCAGAAG AATAAATGTG CATTTTGTGG ATTGACCTTT AGAGTAACTG ACCTAATAGA GGTAAACTAT GTAATACCTA AGTTTAAAGG TGGTGACAAC ATACTAAGGA ATAAACAATT GTTGCACCAA TATTGCCACG AGACTAACAT TGCTTTAAAT CACAAGAGCT ATCTAATAGG CAATTCACAG GACTTACCTG AATGTTACTT ATGGGTTAAC GATATGCTGA CACTAAAGCA GGGATGTACC CTTGAATTGG GACCTCTAAC AGAGGAGCCG GATGAAGCGA AAGTTTCATG TCCGGTTCTG AAGACAAGTC GGGTAGGGTG A
|
Protein sequence | MNKAKTLKRN LGNPQPFLSV AWDTEDIPSQ VSINLTLKWR DIDWRRAMKN VFKLQKLIYR ASSCGKIRKM PRYQKLLTKS YYPLLLTVRW VRQDNQVKNT VGVDGIKNLP PMQRFNFVYL LKSHSLKAIL NPRVSILKQG LNAKFSLSIP TMYDRALQGS VKLSMEPEWE KRFEPNSYGL GLGRSTNDAI EIIFTSIKNK SKYFLDVDIS KCFDLFDLIN HDALLGKIKK SSYRRLIKQC LKSRVLDNNQ FYPPQQLLFT YPDSFLLGNR CGLVELDTAQ GEVMSPLLAN ITLHGIEKRL MEFAKTLDLK NKKGAQMSWQ SKCQSLILVD YVDEFVILQE DIKVLLQAKT VMQEWLNQVR LELKQELTRI ANALEEYKGN KPGLDFFGFT MRQGQAKIAK LGFQTLIKLS ARSIKTHYRK LAERFDSYKI APVKGLIAKL NPAISGWVNY FSTQVSTNKI FNKLDMLLWK RLWCCASRQH PNNSATWVKQ EYFPNIENGN WILPRGEYML NQYSDVPIIG DIKVKDNNLT LDGDWNYWTS RVGKYSGVKT ETSKLFKSQK NKCAFCGLTF RVTDLIEVNY VIPKFKGGDN ILRNKQLLHQ YCHETNIALN HKSYLIGNSQ DLPECYLWVN DMLTLKQGCT LELGPLTEEP DEAKVSCPVL KTSRVG
|
| |