Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4275 |
Symbol | |
ID | 4245927 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 6591534 |
End bp | 6594539 |
Gene Length | 3006 bp |
Protein Length | 1001 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 638109167 |
Product | hypothetical protein |
Protein accession | YP_723745 |
Protein GI | 113477684 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1743] Adenine-specific DNA methylase containing a Zn-ribbon |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.196704 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAATT ATCCCAAACG CTTAATTGAA GTAGACCTCC CAATTAAACA AATCTCTGCC CATGCTAGAC GAGAAAAATC AATTCGTCAC GGTCATATTT CGACCTTACA TATTTGGTGG GCAAGACGAC CTTTGGCTGC TTGTCGGGCT GTAATTTGTG CTGCTCTTTG GTTAGACCCA GTTGATGATA ATTGCCCGGA ATTGTTTCGG GTTGAGGCTG CAAAAGTTAT TAATGGTTTT GCCAAAATAG TTGATAAAAA TTTAGATAAT CATGGCTCCA CAGCAAACCT AAAAAAATGG CAAGTTTTAG CTAAACCTAA AAATCAGTTA AATCCAGAAA AAATAGAACA TTTAAACATA CTCCGTTATC GCTTATTAGA CTTTATTGCT GACTTTGCTA ATTGGGATAA TTCTACACAT CCAGATTATT TACAAACCAG TCGTAAACTA ACAGAAATTA GTCATTATTC TTTAGGAGGA ATTACTGATA CAAAACCCGT AGTTTTTGAC TCTTTTGCGG GTGGAGGTGC GATTCCTTTG GAGGCTTTGC GGGTAGGAGC AGATGCTTTT GCCTCTGATT TGAATCCGGT GGCGGTTTTG TTAAATAAGG TTGTGTTGGA ATATATTCCT AAGTATGGGC AAACTTTGGC GGATGAGGTG CGTAAGTGGG GAGAATGGAT AAAGCAGGAG GCAGAAAAGG AATTGGGGGA GTTTTATCCT AAATCTGCTG CAACACTTAA GAATGGGGAT GTGGAAACAC CTATTGCTTA TCTTTGGGCG CGGACGATTG TTTGTGAGGG GCCTGGATGT GGGGCGGAGG TTCCTTTGAT GCGGAGTTTG TGGTTGGCAA AGAAAAAGAA TCGGTCGGTG GCGTTACGGT TGATTCCTCG GCAGGAGGAA AAGCGAGTTG ATTTTGAGAT TGTGGAGCAG GTGAAAGGGA AAGATGTTGG GGATGGAACG GTGAAACGAG GTTCAGCGAC TTGTCCTTGT TGTGGGTTTA CGACTCCGGT GGCTTCGGTG CGAAAGCAGT TGCAAGCAAG GGGTGGGGGT GCGGATGATG CCCGGTTGTT TTGTGTGGTG ACGACTAGGG AGAAGGTGAA GGGTAGGTTT TATCGGTTGC CCAATGAGAG AGATTTGGAA GCGGTGAGAA TGGCAGGTGA GGAGTTGGCA AGGCGGAAGT TGGAGTATGG TGGGGAGTTG AGTTTGGTGC CGGATGAACC GGTGCCAGTT ATGAGCGGTG TTTTTAATGC GCCCATTTAC GGACATAACA CCTGGGGAAG TTTATTTTCT TCTCGTCAAG CTTTAGCTTT AACTACTTTG GTGAGGTTAG TTAAGGAAGT GGGAAAAAAA TTGGCTAGTA ATGAGGATGA AAGGTTGGCG ATCGCTATTC AGACTTGTTT GGCTTTGGCA GTTGATCGCT GCACTAATCA GTTTTGCTCT CTATCAAAGT GGAATAACAG TCGAGAGTTA ATAGACGGTG TTTTCGCTAG GCAAGCTTTA CCCATGCTTT GGGATTTTGG AGAAACAAAT CTAGTGGGAG GTTCTGACGG ATACTGGCAA GGTGCTGTCA GTTGGGTTTT ATCCGTCATC AAAGCATCAT TATTACCCGA CCCCGGTCAA ACCCAACAAG CAAATGCAGC GAGTCACCCC CTACCAGACG ACTTTACTCA ATGCTTCTTT TCCGATCCTC CTTATTACAA CGCTGTACCC TACGCTGACT TATCCGACTT TTTCTATGTC TGGCTAAAAA GAACTCTAAA CAAAACCTAC CCCAACTTAT TTTCTAGTGA AACTACTCCG AAAGATGATG AAATTTGTGA AATGGCAGGA TGGGATTCCA AAAGATACCC ACACAAAAAC GGCAAATGGT TTGAAACCCA GATGGCAAAA GCAATGGCAG AAGGATGTCG CATCATATCC TCAGACGGAA TAGGTATTAT TGTCTTTGCC CACAAATCAA CTGCTGGATG GGAAGCTCAA CTACAAGCAA TGATAGATGC TGGTTGGAAA ATAACCGCAT CCTGGGCAAT AGACACAGAA AGGGGCAACA GACCCCGCGC CCAAAACTCT GCTGCCCTCG CCTCATCCAT CCATCTTGTT TGCCGACCCC GCAACAATAA CAATATCGGC GACTGGCGCG ACATCATCCA AGAGCTACCC CAACGCATCC AAGACTGGAT GCCACGTCTG ACATCAGAAG GTATAGTCGG TGCTGATGCT ATCTTTGCTT GTATCGGACC CGCCCTAGAA ATATTCTCTC GTTACTCCAG CGTGGAAAAA GTCAATGGGC AACCAGTCAC TTTGGGCGAA TACCTAGAAT ATCTCTGGGG AACAGTAGCA AACGAAGCCT TAAAACAGAT ATTCTCGGAA GCTGACACTA ATAACCTGGA AGCAGACGCT CGTCTCACAG TCATTTGGTT ATGGACGCTT TCTACTGGCA GCACCGAATC AGCAACAGAA ACAGAGTCAG ACCAAGATGA GGAAGACACC CCCGCCAGCA AAAATCAAAA ATCTACAGGT TTTGCCTTAG AGTTTGACAC CGCCCGCAAA ATCGCCCAAG GTTTAGGAGC GCACCTAGAA AACCTCACCC ATTTAGTAGA AATAAAAGGT AATAAAGCCA GACTTTTACC AGTACAAGAA AGGAGTCATT ATTTAATCGG AAAAGTTGCC CCTACTACTA ATACTAAAAA ACGCAAAAGT AAGACTCCAA AACAACTAAG TTTAGAAGGT ATTTTGCCAG AAAAAGAAAC TGAAATAGAA GTAGAAGCAG ATATAGCGAT CGCTGAGATA GGCAAAACCG TATTAGACCG GATACATCAA ACCATGTTGT TACACAAAAC AGGACGTAGC GAAGCAGTGA AACGACTATT AGTAGAGGAG GGAGTAGGCA AAGACCCCAG ATTTTGGAAT TTAGCAGATG CCCTATTGCG GGTATATCCT CAGAATGTGG AAGAGAAACG TTGGTTAGAG GGAATATTGG CAAGGAAAAA AGGATTGGGT TTTTAG
|
Protein sequence | MTNYPKRLIE VDLPIKQISA HARREKSIRH GHISTLHIWW ARRPLAACRA VICAALWLDP VDDNCPELFR VEAAKVINGF AKIVDKNLDN HGSTANLKKW QVLAKPKNQL NPEKIEHLNI LRYRLLDFIA DFANWDNSTH PDYLQTSRKL TEISHYSLGG ITDTKPVVFD SFAGGGAIPL EALRVGADAF ASDLNPVAVL LNKVVLEYIP KYGQTLADEV RKWGEWIKQE AEKELGEFYP KSAATLKNGD VETPIAYLWA RTIVCEGPGC GAEVPLMRSL WLAKKKNRSV ALRLIPRQEE KRVDFEIVEQ VKGKDVGDGT VKRGSATCPC CGFTTPVASV RKQLQARGGG ADDARLFCVV TTREKVKGRF YRLPNERDLE AVRMAGEELA RRKLEYGGEL SLVPDEPVPV MSGVFNAPIY GHNTWGSLFS SRQALALTTL VRLVKEVGKK LASNEDERLA IAIQTCLALA VDRCTNQFCS LSKWNNSREL IDGVFARQAL PMLWDFGETN LVGGSDGYWQ GAVSWVLSVI KASLLPDPGQ TQQANAASHP LPDDFTQCFF SDPPYYNAVP YADLSDFFYV WLKRTLNKTY PNLFSSETTP KDDEICEMAG WDSKRYPHKN GKWFETQMAK AMAEGCRIIS SDGIGIIVFA HKSTAGWEAQ LQAMIDAGWK ITASWAIDTE RGNRPRAQNS AALASSIHLV CRPRNNNNIG DWRDIIQELP QRIQDWMPRL TSEGIVGADA IFACIGPALE IFSRYSSVEK VNGQPVTLGE YLEYLWGTVA NEALKQIFSE ADTNNLEADA RLTVIWLWTL STGSTESATE TESDQDEEDT PASKNQKSTG FALEFDTARK IAQGLGAHLE NLTHLVEIKG NKARLLPVQE RSHYLIGKVA PTTNTKKRKS KTPKQLSLEG ILPEKETEIE VEADIAIAEI GKTVLDRIHQ TMLLHKTGRS EAVKRLLVEE GVGKDPRFWN LADALLRVYP QNVEEKRWLE GILARKKGLG F
|
| |