Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3332 |
Symbol | |
ID | 4243503 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 5111276 |
End bp | 5112709 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 638108317 |
Product | von Willebrand factor, type A |
Protein accession | YP_722908 |
Protein GI | 113476847 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1240] Mg-chelatase subunit ChlD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0350677 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACTGGC AAAACTTGAC TCAAAAACTC CCTAAACCAA TTATCTTCGG AATATTCGGC GGTGCTGGAT GTCTTATAGC TGCGGCAATA TTTGGTGAAA TGTGGCTGTC TCTGACCAGG CGACCTCCCC AACCTCAAAC TGTCGTTCTC CTCATCGACA CTTCTTCTAG CATGTGGGGT GGTAAACTTC CAGAAGTCCA AGCAGCAGCT ACCGGATTCG TTGAACGACA AAATTTAACT GTTAATAACT TAGCCATTGT AGAATTTTCC AGCAACTCGC AAGTTCTGAC CAATTTTGAT GCTGATAAAA CTGAACTCAA ACAAGCGATC GCTAATCTCA CCCCATCTGG AGGTACAAAC CTTTCTCAAG GCCTCAAAAC AGTCGCTTCT CTTCTGCGAA ACAGCAACAC TCCCAATATT CTCCTATTTA CAGATGGTCA ACCCAACGAC CCTAGGGCCT CAAAATCAAT AGCTAGAGAA ATCCGAGAGG CAGGAATTAA TTTAGTCACA GTGGGAACCG GAGATGCAAA CAGTAACTAT CTCACTTCCT TGACAGAAAA TCCAGACCTA GTCTTTTTTG CTAACTCTGG AGAAATAGAC CAAGCTTTCC GAGCTGCTGA AAAAGCCATC TCACAACTAT CTGACACAAG TGGTAATTAC GGCTTAGTCT TCGGTATTTT CCGCATAGGG GCATGGACCG GTTTTCTGGC TCTCGGAATT GGACTGGCCT TAATCCTTGG ACAAAACTAT AACCTCCGCC GTCGGTTGTT GTCGAAGCAA GAAGTTGCTC TTGGAGGTGG GGGTGGTTTT CTCGCTGGAG TAGTAGGTGG AGCGATCGGT CAATTGGCGC TTCTGTCAAG TACTAATCTC CCGACTTTAG CGATCGTAGC TCGAATGACC GGCTGGACTT TTCTCGGAAC CCTTGTTGGT GGTGGAACGT CTTTATTTGT TCCTAACCTA CCTCGTGAAA AAGCCTTGAT CGCCGGAGGG TTAGGAGGTG TGTTAGGAGC GACTTGCTTT CTCTTGCTCA ATGCATTGGT AGGTGTGCTT CCAGCTCGTT TGGTAGGAGC AGGAATTTTA GGATTTTGCA TTGGGTTGGC TATCGCTTTT AGTGAACAAC TAGACCGGGA GGTAGTATTG TTGGTTCGCT GGAACAACTC AGAATTTACA ACTATTTCCT TGGGAAAGGA ACCCATTGAA CTTGGTAGCT CCCGGAATGC TCATATTTAT CTATCAAGAG ATGCTGGTTT TCCCGCTAAG TTTGCTAAGA TATTTATTGA AGAAGAAAAA ATTATTTTAG AATTTGACCC GTCAATTAGA GAGCGCCCGA AGTTTCAAAA TATGAAAGTT TTGAAACAGG AACTTTCATA TGGCTCAAGT CGTAAATTCG GAGATGTTTT ATTAGAAATT CCACAAAAAA ACATACTAAA ATAA
|
Protein sequence | MNWQNLTQKL PKPIIFGIFG GAGCLIAAAI FGEMWLSLTR RPPQPQTVVL LIDTSSSMWG GKLPEVQAAA TGFVERQNLT VNNLAIVEFS SNSQVLTNFD ADKTELKQAI ANLTPSGGTN LSQGLKTVAS LLRNSNTPNI LLFTDGQPND PRASKSIARE IREAGINLVT VGTGDANSNY LTSLTENPDL VFFANSGEID QAFRAAEKAI SQLSDTSGNY GLVFGIFRIG AWTGFLALGI GLALILGQNY NLRRRLLSKQ EVALGGGGGF LAGVVGGAIG QLALLSSTNL PTLAIVARMT GWTFLGTLVG GGTSLFVPNL PREKALIAGG LGGVLGATCF LLLNALVGVL PARLVGAGIL GFCIGLAIAF SEQLDREVVL LVRWNNSEFT TISLGKEPIE LGSSRNAHIY LSRDAGFPAK FAKIFIEEEK IILEFDPSIR ERPKFQNMKV LKQELSYGSS RKFGDVLLEI PQKNILK
|
| |