Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0271 |
Symbol | |
ID | 4241638 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 417849 |
End bp | 420950 |
Gene Length | 3102 bp |
Protein Length | 1033 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 638105612 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_720227 |
Protein GI | 113474166 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.977531 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0705059 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGTTA AACTGTTGGA CGATTTTTCG AAATCTCTCG GGTTGGAAAC TGTTAACGGT GGGGCTAACC TTGTTAAGGC GCTTTTTAAA TTAGCTGAGA CACTAGAAAA ATATAAGGAT ACTCAGGAAT TAAAGTCTGC TATTGAAAAG ATTAATATTG AATCTCTGTT GGATGCACTA AATTCTCCAT TGGGAAGTGT TGTTAGAGAT AGTGTACCTT TCTTGCCTAT TGCAACTGGA ATTATTTCTT ATATTGTTAA ACAAAGTCGT CAGGAGCCTA CTTTGGAAGA TGAGGTGCAG TTGTTGGTTC AGGTGGCTTT TTTAGAAAGT TTTCGACAAT TTTTTGTAGA GCATCCAGAA ATTAAGGAGC AGTTCACAGA CACTGAGGCT TCGGAGGATG TAAAAAAACA GATTAAGAAG TTGGGTAAAG ATGGTTTTAA TCGCCAAGAT GCTAAGGATA CTTTGATTTG TTTTTATGAT TCACCGTTGA GAGAAAAGTT TGATAATGTT GTATTGGCAC GCTTAAAAGA GTCTGGTTTG GATGAAAAAA CGGCGAAAAT AGTGACTGAA AGAATTTCTC GCAGTACCCA TCGTTATATG AAAGAAGCGG TATCAGAGGT GAAGGATGAT GCTAAAAAAT TAGCGGGAAT TTATGGGTAT GGCTGGCAGC AAGATTTGGA GGTTTATGGG AGTGTTGATG GGTATTTGAA AAATAAAGTT GCTCGTCTGC CAGAGGAAAA AGTATTTGAT GAAAGTTTCT CTTTTCAGGA TATTTATGTA CCACTTGAGG TCGAGTCTGT GTCCGATGGA GAGGTTGATA AAAATGCTGA ATCTCAGAAT ATTGAGAAGT GGGCAAAAAC AATTTTGTTA CCTCAACACT TTGATGAAAA TTTTGATGAA AATTCGCGGA AAAATAATGA GGAAAAAGAT AAAAAAGATA AACAGGTTTT ATTTATTCAA GGGGGGCCTG GAAGAGGAAA AAGTGTATTT TGCCGGATGT TTGCTGACTG GGTGCGGCGG GAACTACATC CTATCTATAC CCCTATTTTG ATCCGGTTGC GAGATGTGAA AAATTTTGCA GCAAATATTG ATCAGACTCT GGCTGATGCT GTGAATTACG ATTTTGTGAA AATTGATGGG GGCTGGTTGA CAGACCGGAA TACTCGGTTT CTGTTTTTGC TGGATGGGTT TGATGAGTTG TTGTTGGAGC GGGGGGCAAG CAGTGAGTTG AAGGTGTTTT TAGATCAGGT AGCACAGTTT CAGAAACAGG CAGCGGAAAA TAAGGAGCGC GGTCATCGGG TTTTGATCAC GGGTAGACCT TTGGCGCTTT ATGGTATTGA AAGGTTGATG CCTCCTAATT TGGAGCGGGT GAGTATTTTG CCGATGGATG ATGATATCCA GGGACAGTGG TTTGAGAAGT GGCGGACGGT GGTGGGGGAT GAGGAAACAG AAAAGTTTGG GGAGTTTTTA CGGAGTGAAC AATGTCCGGA GCAGGTTAAC GAGTTGGCGC GGGAACCTCT TTTGTTGTAT TTGCTGGCTT CAATGCACAG GGGTGGGAAG TTAAAGGTAG AGATGTTTGC TAATACTGAT GTTGGTGAGA CAAAGGTTTT AATTTATGAG CAGGCGTTGG AGTGGGTACT GGAAAAACAG CGGGTAGAAG AGGGTAAAAA TATTAATCTT GAAATTACGA AGTTGGAGCC AGAAGATCTG GAGGTTTTAT TGACAGAGGC GGGGTTGTGT GTGGTGCAGT CTGGTGGGGA ATATGCTGCT ATAGAAATGA TCGAGGAGCG GCTGGTGAAA AAGGGAGATA AGGAGTTAAA GCGGTTAATT GAAAATGCTA GGGAAGATAA GCGGGAGGAT GGGTTGAAAA ATGCTTTGGC TGCTTTTTAT TTGAAGTCGG CAACGGGGGC AGAAAATTCG GTGGAGTTTT TTCATAAGAG TTTTGGTGAG TTTCTCTGTG CAAAACGGAT GGCAGAAAGT CTGGAATATT TTACGCAGAA AACTAAGAAA GGACGCAAGT TTAGTTACTC TGTTTCTGAT GAGGAGTTGG TATGGCAGGT TTACGATCTG TTTGGTTATG GGGGTTTAAC CGTGGAAGTT GTGGAGTATT TGATGGCTTT GTTGGTGAAA AGTGAAGCAA AGCTGGTGGT TTTGTTTGAG CGCTTACATG AGTTTTATCT AGACTGGTGT GATGGGAAGT TTATTGAGGC GACGGAGGAA ACTTTACCTC AGAAAAAGGC GAGGGAGTTG CAGCAGTGGC GTATTAAGTC TGGGCAACGC CAAGTGGATA TTTATACTGG GTTGAATGTG ATTATTTTGT TGTTTGAGCT GGATCGTTAT GGGCAGTCTC AGGAGGAGTT GCGGGAGCAG TTGCGTTTTT ATCCTTGCGG TCAACATGGT AGTGAAAATT TTGAGGCTCT AAAACTGTTA ACAATTATTG GCTATAGTCA ATGTTTAGGA GTTGGTGCGT TTTGGGAAAC AGTAGGTCAG TTTCTAAGTG GTGCCGACCT CAGATATGCC GACCTCAGTG GTGCCTACCT CATAGTTGCC AACCTCAGAT ATGCCGACCT CAGTGGTGCC TACCTCATAA GTGCCGACCT CAGTGGTGCC TACCTCATAG GTGCCAACCT CATAGGTGCC GACCTCAGTC GTGCCGACCT CAGATATGCC GACCTCAGTG GTGCTAACCT CAGTGATGCC AAACTCAGTG GTGCTAACCT CAGTGATGCC AAACTCAGTG GTGCCGGCCT CAGTGGTGCC GACCTCAGAT ATGCCGACCT CAGTGGTGCT GACCTCAGTC GTGCCAAACT CAGTGATGCC GGCCTCAGTG GTGCCAACCT CAGTGTTGCC GGCCTCAGTG GTGCCGACCT CAGATATGCC GACCTCAGTG GTGCCGACCT CAGATATGCC GACCTCAGTG GTGCTGACCT CAGTGATGCC AACCTTAGTA ATGTCAGATG GAATAGTCAA ACCAAGTGGT CAAATACTAT TGGTTTACAT GAAGCAATAG AAGTTCCAGA AGACTTACAA CAAGATCCAG AATTTGCCGC AGCAGTTGCT GAGTCGGAAG CAGCAAGTCA AGAACAACAG TACATTTTTT GA
|
Protein sequence | MAVKLLDDFS KSLGLETVNG GANLVKALFK LAETLEKYKD TQELKSAIEK INIESLLDAL NSPLGSVVRD SVPFLPIATG IISYIVKQSR QEPTLEDEVQ LLVQVAFLES FRQFFVEHPE IKEQFTDTEA SEDVKKQIKK LGKDGFNRQD AKDTLICFYD SPLREKFDNV VLARLKESGL DEKTAKIVTE RISRSTHRYM KEAVSEVKDD AKKLAGIYGY GWQQDLEVYG SVDGYLKNKV ARLPEEKVFD ESFSFQDIYV PLEVESVSDG EVDKNAESQN IEKWAKTILL PQHFDENFDE NSRKNNEEKD KKDKQVLFIQ GGPGRGKSVF CRMFADWVRR ELHPIYTPIL IRLRDVKNFA ANIDQTLADA VNYDFVKIDG GWLTDRNTRF LFLLDGFDEL LLERGASSEL KVFLDQVAQF QKQAAENKER GHRVLITGRP LALYGIERLM PPNLERVSIL PMDDDIQGQW FEKWRTVVGD EETEKFGEFL RSEQCPEQVN ELAREPLLLY LLASMHRGGK LKVEMFANTD VGETKVLIYE QALEWVLEKQ RVEEGKNINL EITKLEPEDL EVLLTEAGLC VVQSGGEYAA IEMIEERLVK KGDKELKRLI ENAREDKRED GLKNALAAFY LKSATGAENS VEFFHKSFGE FLCAKRMAES LEYFTQKTKK GRKFSYSVSD EELVWQVYDL FGYGGLTVEV VEYLMALLVK SEAKLVVLFE RLHEFYLDWC DGKFIEATEE TLPQKKAREL QQWRIKSGQR QVDIYTGLNV IILLFELDRY GQSQEELREQ LRFYPCGQHG SENFEALKLL TIIGYSQCLG VGAFWETVGQ FLSGADLRYA DLSGAYLIVA NLRYADLSGA YLISADLSGA YLIGANLIGA DLSRADLRYA DLSGANLSDA KLSGANLSDA KLSGAGLSGA DLRYADLSGA DLSRAKLSDA GLSGANLSVA GLSGADLRYA DLSGADLRYA DLSGADLSDA NLSNVRWNSQ TKWSNTIGLH EAIEVPEDLQ QDPEFAAAVA ESEAASQEQQ YIF
|
| |