Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4420 |
Symbol | |
ID | 4246073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 6811774 |
End bp | 6814512 |
Gene Length | 2739 bp |
Protein Length | 912 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 638109304 |
Product | hypothetical protein |
Protein accession | YP_723881 |
Protein GI | 113477820 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0816667 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTGAACG ATGGTAATGT TGATAGTAAC ACAGCCACAA CAACAATAAA TATTACTCCA GTCAATGACG CTCCCGTATT AGACTTAGAT GGGAATAATA GTAGTACCGC TACAGGCAGT GACTACATAA CAACTTTCAC AGAAGGCGGA GGAGCAGTAG CCATAGGAGA CAGCGATGTT AGTATCACAG ATGCAGATGA TATCAACATA GAGTCAGCCA CAATAACATT AAGCAGTCGA CCAGACGGAG ATACAGTAGA AAGCTTATTA GTCAACGGTA CACTGCCAAC AGGAATAACA GCGAGCAGTT ATGACAGTAG TACAGGAGTC ATAACACTGA CAGGTAGCGC TACATTAGCT GACTATCAGA CAGCCATAGC CCAAATACAA TATAACAACA CCTCGGAAAA CCCAGACACC AGTGCTCGGA GCGTGACCGT AGTGGTGAAC GATGGTAATT TTGATAGTAA CACAGCCACA ACAACAATAA ATATTACTCC AGTCAATGAC GCTCCCGTAT TAGACTTAGA TGGGGATAAC ACAGGGAATG ACTACACAAC AACTTTCACA GAAGGCCTAG GAGCAGTAGC TATAGGAAAC AACGTTGGTA TAGCAGATGA AGATGATACC AACATAGAGT CAGCCACAAT AACATTAGGC AGTAGACCAG ATGGAGATAC AGTAGAAAGC TTATTAGTCA ACGGTACACT ACCAACAGGA ATAACAGCGA GCAGTTATAA CAGTACTACA GGAGTCATAA CACTCACAGG CAGCGCTACA TTAGCTGAGT ATCAGACAGC CATAGCCCAA ATACAATATA ACAACACCTC GGACAACCCT AATACCACCG ATAGGACAGT GACCGTAGTG GTGAACGATG GTGATGCTGA TAGTAGTACA GCCACAACAA CAATAAATAT GACTCCAGTC AATGATGCTC CAGTATTAGA CTTAGATGGG GATAATAGTA CGACAACAGG CAGTGACTAC ATAACAACTT TCACAGAAGG AACGGCAGTA AACATAGGAG ACAGCGATGT TAGTATCACA GATGTAGATG ATAGCAACAT ACAGTCAGCC ACAATAACAT TATCGAACAT ACAAGACGGA GCATCAGAAA GCTTATCCGC CGGTACACTG CCAACAGGAA TAACAGCGAG CAGTTATGAC AGTAGTACAG GAGTCATAAC ACTGACAGGT AGCGCTACAT TAACTGACTA TCAGACAGCC ATAGCCCAAA TACAATATAA CAACACCTCG GAAAACCCAG ACACCAGTGC TCGGAGCGTG ACCGTAGTGG TGAACGATGG TAATGTTGAT AGTAACACAG CCACAACAAC AATAAATATT ACTCCAGTCA ATGACGCTCC CGTATTAGAC TTAGATGGGG ATAACACAGG GAATGACTAC ACAACAACTT TCACAGAAGG CCTAGGAGCA GTAGCTATAG GAAACAACGT TGGTATAGCA GATGAAGATG ATACCAACAT AGAGTCAGCC ACAATAACAT TAGGCAGTAG ACCAGATGGA GATACAGTAG AAAGCTTATT AGTCAACGGT ACACTACCAA CAGGAATAAC AGCGAGCAGT TATAACAGTA CTACAGGAGT CATAACACTC ACAGGCAGCG CTACATTAGC TGAGTATCAG ACAGCCATAG CCCAAATACA ATATAACAAC ACCTCGCAAA ACCCAGACCC CACGGATCGG ACCGTGACCG TAGTGGTGAA CGATGGTGAT GCTAATAGTA ACACAGCCAC AACAACAATA AGTCTTGTTC CAGTCAATGA CCCAGTCCAC TTTGATTTTA ATGCTGATGG AGTGGCAGAC ATTCTCTGGC GTCATAAAAG TCTCCAAAAT GGACCTAACA GGATCTGGTT GATGAAGAAT GACGGCACAC GGGATAGTAT CGTTAACCCT GGATCTTTTG GTTCAAATTG GAATGTAGAA AGAGTGGGAG ATTTCAATGC AGATGGAGTG GCAGACATTC TCTGGCGTCA TCAAAGTCTC TCATCTGGAC CTAACAGGAT CTGGTTGATG AAGAATGACG GCACACGGGA TAGTATCGTT AACCCTGGAT CTTTTAATTC AAATTGGAAT GTAGAAGAAG TGGGAGATTT CAATGCAGAC GGAGTGGATG ACATTCTCTG GCGTCATAAA AGTCTCCAAA ATGGACCTAA CAGGATCTGG TTGATGAAGA ATGACGGCAC ACCCGATAGT ATCGTTAACC CTGGATTTTT TGGTTCAAGT TGGAATGTAG AAGAAGTGGG AGATTTCAAT GCAGATGGAG TGGCAGACAT TCTCTGGCGT CATAAAAGTC TCCCACATGG ACCTAACAGG ATCTGGTTGA TGAAGAATGA CGGCACACCC GATAGTATCG TTAACCCTGG ATTTTTTAAT TCAAATTGGA ATGTAGAAGA ATTGGGAGAT TTCAATGCAG ATGGAGTGGA TGACATTCTC TGGCGTCATA AAAGTCTCTC ACATGGACCT AACAGGATCT GGTTGATGAA GAATGACGGC ACACCCGATA GTATCGTTAA CCCTGGATTT TTTAATTCAA ATTGGAATGT AGAAGGAGTG AGAGATTTCA ATGCAGATGG AGTGGATGAC ATTCTCTGGC GTCATCAAAG TCTCCCAAAT GGACCTAACA AGATCTGGTT GATGGAGAAT GACGGCACAC GGGATAGTAT CGTTAACCCT GGATCTTTTA ATTCAAATTG GGATATAGCT GGAATGTAA
|
Protein sequence | MVNDGNVDSN TATTTINITP VNDAPVLDLD GNNSSTATGS DYITTFTEGG GAVAIGDSDV SITDADDINI ESATITLSSR PDGDTVESLL VNGTLPTGIT ASSYDSSTGV ITLTGSATLA DYQTAIAQIQ YNNTSENPDT SARSVTVVVN DGNFDSNTAT TTINITPVND APVLDLDGDN TGNDYTTTFT EGLGAVAIGN NVGIADEDDT NIESATITLG SRPDGDTVES LLVNGTLPTG ITASSYNSTT GVITLTGSAT LAEYQTAIAQ IQYNNTSDNP NTTDRTVTVV VNDGDADSST ATTTINMTPV NDAPVLDLDG DNSTTTGSDY ITTFTEGTAV NIGDSDVSIT DVDDSNIQSA TITLSNIQDG ASESLSAGTL PTGITASSYD SSTGVITLTG SATLTDYQTA IAQIQYNNTS ENPDTSARSV TVVVNDGNVD SNTATTTINI TPVNDAPVLD LDGDNTGNDY TTTFTEGLGA VAIGNNVGIA DEDDTNIESA TITLGSRPDG DTVESLLVNG TLPTGITASS YNSTTGVITL TGSATLAEYQ TAIAQIQYNN TSQNPDPTDR TVTVVVNDGD ANSNTATTTI SLVPVNDPVH FDFNADGVAD ILWRHKSLQN GPNRIWLMKN DGTRDSIVNP GSFGSNWNVE RVGDFNADGV ADILWRHQSL SSGPNRIWLM KNDGTRDSIV NPGSFNSNWN VEEVGDFNAD GVDDILWRHK SLQNGPNRIW LMKNDGTPDS IVNPGFFGSS WNVEEVGDFN ADGVADILWR HKSLPHGPNR IWLMKNDGTP DSIVNPGFFN SNWNVEELGD FNADGVDDIL WRHKSLSHGP NRIWLMKNDG TPDSIVNPGF FNSNWNVEGV RDFNADGVDD ILWRHQSLPN GPNKIWLMEN DGTRDSIVNP GSFNSNWDIA GM
|
| |