Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3770 |
Symbol | |
ID | 4243718 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 5792469 |
End bp | 5793836 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 638108707 |
Product | hypothetical protein |
Protein accession | YP_723291 |
Protein GI | 113477230 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00806133 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGAGGTAA AAAAACTCTT GAGTAAATTA GACTTAATTA GATTTTATAA AGCTTGTAAT CCAAGCAAGA CCTTGATAGT AGGAAATGCA GAAGATCAGC AATATTATAT AGATTTTGCC AGTGTTAGAG GTAGCGATAT AATTAAAGAA TTAGAAAGAA CTATAATACT ATTATCAGGA AATCAGCCGA CTTGTCAACT ATTTACAGGC CACATAGGTT GTGGTAAGTC TACCGAATTA TTTCGACTTA AAGATAAATT AGAAAAACAA GGATATCATG TTGTTTACTT TGAGTCTTCA GAAGACTTAG ATATGGGAGA CGTAGATATT AGCGATATTT TGCTAGCGAT TGCTCGTCAA GTCAGTGAAA GCATGGAACA AGCTAAAATT CAAATCAAAC CAAGTTACTT CCAAAAGTTA TTTGGTGAAA TATCAGAACT GTTACAAACT CCCATAGAAC TTTCCGCAGA AGCAGAGCTA TCTGTTGGTA TTGCTAAAAT TACTGCTAAA ACCAAAGATT CCCCTAAACT ACGTTCTCAG TTAAGACAAT ACCTCGAACC CCGAACTAGC ACTATTCTAG AATCAATCAA TCAAGAATTA TTAGAACGTA CCAATATAGA GCTCAAGAGA CGAGATAAAA AAGGACTTGT TGTGATTGTT GATAATCTAG ACCGTGTAGA TAACTCACCC AAATCTTGGG GACGTACTCA ACCAGAATAT TTATTTGTAG ACCGAGGCGA GCAACTCAAA AAACTCAACT GTCATGTCGT CTACACAATT CCTCTTACAT TAATGTTCTC TAATGATTAT GGCAGATTAT CTAGTCGATT TGGAGTGAAA CCAAAAATAT TACCAATGGT GCCTGTACAG ATTAGGTCTA AAAAACCGAC AATAGCCCCA AATGACTATG AGACAGGAAT AAAATTATTA CGAGAAATGG TATTAGCTAG AGCTTTTCCA GAAATACCAT CCCAAAAACG TCTAGAATTT ATTCCCGCTC TATTTGAAAC TTCAGAAACC CTTGATAGAC TTTGTCGAGT TAGTGGGGGT CACATGCGAA AACTGTTAAT GTTGCTCTAT AGTTGTCTGC AACAAGAAGA TCCTCCTTTT TCTAGTGAAT GTTTAGAAAA TGTGATTCAG GAATATCGAG ATGATCTAAC TAGAGCTATT ACTGTTGACG AATGGGAGTT GCTATTTAAA GTAGTACAAA ATCAAAGAGT AACAGGAGAA GAAGAATGCC AAGCTTTACT CAGAAGTATG TTTGTTTTTG AATATCGAGA TCGGGATGGT CATTGGTTTG GAATTAATCC ATTGTTGGCA GAAACTAAAA AATATGCTAG TTGGGTTAAT AGTAATTATC AGGAGTGA
|
Protein sequence | MEVKKLLSKL DLIRFYKACN PSKTLIVGNA EDQQYYIDFA SVRGSDIIKE LERTIILLSG NQPTCQLFTG HIGCGKSTEL FRLKDKLEKQ GYHVVYFESS EDLDMGDVDI SDILLAIARQ VSESMEQAKI QIKPSYFQKL FGEISELLQT PIELSAEAEL SVGIAKITAK TKDSPKLRSQ LRQYLEPRTS TILESINQEL LERTNIELKR RDKKGLVVIV DNLDRVDNSP KSWGRTQPEY LFVDRGEQLK KLNCHVVYTI PLTLMFSNDY GRLSSRFGVK PKILPMVPVQ IRSKKPTIAP NDYETGIKLL REMVLARAFP EIPSQKRLEF IPALFETSET LDRLCRVSGG HMRKLLMLLY SCLQQEDPPF SSECLENVIQ EYRDDLTRAI TVDEWELLFK VVQNQRVTGE EECQALLRSM FVFEYRDRDG HWFGINPLLA ETKKYASWVN SNYQE
|
| |