Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1917 |
Symbol | |
ID | 4242666 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 2966800 |
End bp | 2970414 |
Gene Length | 3615 bp |
Protein Length | 1204 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 638107038 |
Product | von Willebrand factor, type A |
Protein accession | YP_721645 |
Protein GI | 113475584 |
COG category | [R] General function prediction only |
COG ID | [COG4248] Uncharacterized protein with protein kinase and helix-hairpin-helix DNA-binding domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.137165 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTAGTG AGTTTGAAAA GGTTCTAGAC GAGATTAGAA AACGAAGTAG TAAATATGAA GATATTGGAG ATGTTAAGAG GGAAATTATT AATGCGCTTT CACAAAGATA TAAAACACAG GAACGTTTGT TTACACATTA TGTTCTAGAA AAAGGGGGAT TTGATATTCG TAAATTAACT TTAAGTGACC CTCTGCTAGT AATTATATCT GATTTTTTTA ATCAGTGTAA TGGTCTAAAC AAAAACGAAT TATATAAATT TTTTAGAGTG TTATATGATG AAAATAAAGG TATTAGTTTG CTAAAAGAAA TTAGAGATAC ACTTGTACAA TTTAAACATC AAAGTACTGA TACTCCTACA TATACACCAT CACCATACAT ATCAGTTATT ACTTCAGAAG CAATAGGCTT AGTAGAAGCA AAATATGGAC AGTCTCTAGA ACAGGGAAAA AACTTTGCTA AAAAAGTTGA TTGTAATAAT TTTCAGGATG TACAAGCTTT TTTTAATAAT GGGGGACAAG CTGGTAAACA GCTTGCTATT TTGGAGCCAG GAACCTATTA CATTAACCCC GAAATATTCA CTATTCGTAC AGTTCCTATT ATTAGGATTC TTCAAGGAGA AATAGGACTG GTAATAGCCA ATGAGGGGAC TTTTAAGTCT GATGAACAAA CGTTAGGTAG AGTGGTGGAA TGTGATAACT TTCAGGATGC TGAAGCATTT CTGAAAAATG GAGGACAAAA AGGTAAGCAA CTTGCTATTT TGACTGACGG TGACTATAAA ATTAACACTG ATTTCTTTAG CGTAATTACT ACTACTAATG CCTATAAATA TAATGAAAAT CCAAATAATT TAAAGGTTTA TAAAATCGAC AAAGATAAAA TTGGTATTTT AACTACGATG GTTGGGAAAA TTCTCCCTAA AGGTGAAATT GCTGGACCAA TAATTGAAGG ACACGATAAT TATCAAAATG CACAAAAATT CCTAGATTTA GGAGGATATA AAGGATTGCA AGAAGAAGTT CTCCAAGAAG GAGCCTGGAG TCTAAATCCT TGGTTTGTTG AAGTTGAGCA GGTACCACTA ACTAGAATTG AGCAAGAAGA GGTGGGAGTT GTTATATCTT TTGTTGGAAA AGAATATGAC AAAAATTATG ATCAACAGCC CATTTTTTAT GGTGCAGAAA AGTCTCTATA CCAATTAGTT CCAAAAGGGT ACAAAGGAGT TCAAAAAGAA CCTCTTACTG CTGGTCAATA TGCAATTAAT ACCAGAGTAA AAACGGTCAA ATTAGTGCCA ACTACCCAAA TTATACTGAA TTGGTCAGAT CAGAAAAAAC ATCCCTTAAA TTATGACTAT GAACTGAAAC AAATGAAGCT AATTTCTAGA GACCATTTTG AAATTTTTGT TCAGTTTACC CAAATTATTC GTATTGCTGC TGAGAATGCT CCTAAGATGA TTTGTATGGT TGGATATTAT ACAGGGGAAG ACAAAATATA TGTTACTGAT GATTCTGGGC AAGTTGTTAA GAAATATGCA GTTATTAGAA ATTTAGTTTC TCGTGTATTA ACAAAAGTTG TTTATAGTCA TTTTCAACAA GCTGCTACTG GTAAAACGGC AATACAGTTT CTAAACACAA GAGGTGACTC TCAAAAGGAG GCCGAAAATT ATATAAAAAC ACTTCTTGAA ATGATTGGTG TTGAAGGATG TGGCACTTTT ATGGTTGATA CAGTTAATCT ACCTCTAGTA GTTGATTCTT ACTTACAAGA GAAACAAAAG CAAGAGGCAA GAGAAGCTCA AGCAAAAGCT GCACCTGAAC GGCTTCTTGA ACCTGAATTT GTTGAGAATC CTGAACAACG GCTTCCTGAA CCTGAATTTG TTGAGAATCC TGAAAACCGT TGTCCCATTA TCCTCCTACT GGATACATCT TACTCAATGT CAGGAGAAGC TATTACTGAA TTAAATCAAG GAGTGAAAAT ATTTCAGGCA AGTGTAAAGG AAGATGAACT GGCTTCCTTA AGAGTAGAAA TAGCTGTCAT TACTTTTAAC AGTGAAATTG AAGTAGTTCA AGATTTTGTT ACTGTAGATA AATTTATTCC CAAAACATTA GAAGCATCAG GAGTAACGCA CATGGGAAAA GCTATTGAAA AAGCCCTAGA ATTATTAGAA AAGCGAAAAC AAGACTACAA AAATAGCGAT ATTCAATACT ATCGACCCTG GATCTTTCTA ATTACTGATG GGCAACCTAC TGATACTTGG CAAGATGCAG CAAAAAAAAT AGAAGAAGCT GAAACTAATA GAAAATTACT TTTTTTTGCT GTTGGGGTAA GAGATGCAGA TATGGAGACA TTAAGTGAAA TTTCTGTATG CCCTCCTAAA AAACTCAACG GCTTAGATTT TCAATCTTTG TTTAAATGGC TAAGTTTTTC ACTTCAGCAA GTTTCAGTTA GCAAGATAGG AGAAAAGAAT AGACTTCCTC CAACGAATGC ATGGGAAGAA ATAACTAGTA AAAATCAAAA TACTAAACAG ACTCAACAAA CTCAACAAAA GACAACAATT CCTAATTCTG ACCCTAATCC TGAACCAATT ATCCCTATCT TAGTTTTAGA TAGAGATATT TTTAAAGATC AACGAATTAA GAGTAATTCG GAGGGAGAAA TTTGGATAAC TCAAAAATAT CGCTACCGAA AGAAATACCT AATAAAAATT TATTATGAAG TTACACCAGC AAGGATAAAA AAGTTAGAAG TAATGGTAGC TTATAAACCG AAAAATTTTC ATGGTTCTCA ACAAGCGTGG GCTTGGCCTG AGTATTTACT AGCAGATAAA ACAGGAAAAA TTATCGGCTT TGTCATGGAA TTTATTGAAG ATAGTAAACT GCTATTTAAT ATTTATAATC CTCAGCGTCG TAAGCAAATA AATAGTCAAC TCCACTGGTC AGTAGACTGG CTTTTTCTTC ACCATACTGC TAAAAATATT GCTACTATTA TTCAGTCTCT TCATAGTCAG GATTATGTTA TTGGAGATAT GAAGCCACAA AATATTCTAG TTAACCGATA TGCTTCTGCT TCAATAATTA ATACAGACTC ATTTCAAGTT CGCCATCCTC AGACAAAAGA AATCTATCAT TGTTTAGTTG GTTCCGAAGA ATTTACCCCT CCTGAACTAT TAGAAAAAGA ATTAGCAAAA ATTGTTCAAA CTCCTACCCA TGATAACTTT AGATTAGCCC TTATTATCTA TCATTTATTA TTTGGAGGAC ATCCCTTTAA AGGAAGGTGG ATAGGAACAG AAGAGCCACC CAAAATTGAT GAACTCATCC GACTAGGTTT CTGGTGCTAT GCTCCCAATA GTAAAATTCT ACCAGGACCG AGAACTATTC CCCTTGAAAT AGTTCACCCC AAAATTCAAA AATGCTTCCA AAAATGCTTC AACGATGGAC ACTATCATCC AGAAAAACGA CCCACTCCTC AGAACTGGGT TGACGCTTTA GAGAGTGCCA TTAATGATTT AGTACAGTGT AAAAGAGTTG ATACCCATTG GTATAGTAAA ACTTATGGCA AATGCTATTG GTGCGAAAGA GAAGAAAAAT TAGAGGTTGA CATATTTTCT GATTCCAAAA CATAA
|
Protein sequence | MPSEFEKVLD EIRKRSSKYE DIGDVKREII NALSQRYKTQ ERLFTHYVLE KGGFDIRKLT LSDPLLVIIS DFFNQCNGLN KNELYKFFRV LYDENKGISL LKEIRDTLVQ FKHQSTDTPT YTPSPYISVI TSEAIGLVEA KYGQSLEQGK NFAKKVDCNN FQDVQAFFNN GGQAGKQLAI LEPGTYYINP EIFTIRTVPI IRILQGEIGL VIANEGTFKS DEQTLGRVVE CDNFQDAEAF LKNGGQKGKQ LAILTDGDYK INTDFFSVIT TTNAYKYNEN PNNLKVYKID KDKIGILTTM VGKILPKGEI AGPIIEGHDN YQNAQKFLDL GGYKGLQEEV LQEGAWSLNP WFVEVEQVPL TRIEQEEVGV VISFVGKEYD KNYDQQPIFY GAEKSLYQLV PKGYKGVQKE PLTAGQYAIN TRVKTVKLVP TTQIILNWSD QKKHPLNYDY ELKQMKLISR DHFEIFVQFT QIIRIAAENA PKMICMVGYY TGEDKIYVTD DSGQVVKKYA VIRNLVSRVL TKVVYSHFQQ AATGKTAIQF LNTRGDSQKE AENYIKTLLE MIGVEGCGTF MVDTVNLPLV VDSYLQEKQK QEAREAQAKA APERLLEPEF VENPEQRLPE PEFVENPENR CPIILLLDTS YSMSGEAITE LNQGVKIFQA SVKEDELASL RVEIAVITFN SEIEVVQDFV TVDKFIPKTL EASGVTHMGK AIEKALELLE KRKQDYKNSD IQYYRPWIFL ITDGQPTDTW QDAAKKIEEA ETNRKLLFFA VGVRDADMET LSEISVCPPK KLNGLDFQSL FKWLSFSLQQ VSVSKIGEKN RLPPTNAWEE ITSKNQNTKQ TQQTQQKTTI PNSDPNPEPI IPILVLDRDI FKDQRIKSNS EGEIWITQKY RYRKKYLIKI YYEVTPARIK KLEVMVAYKP KNFHGSQQAW AWPEYLLADK TGKIIGFVME FIEDSKLLFN IYNPQRRKQI NSQLHWSVDW LFLHHTAKNI ATIIQSLHSQ DYVIGDMKPQ NILVNRYASA SIINTDSFQV RHPQTKEIYH CLVGSEEFTP PELLEKELAK IVQTPTHDNF RLALIIYHLL FGGHPFKGRW IGTEEPPKID ELIRLGFWCY APNSKILPGP RTIPLEIVHP KIQKCFQKCF NDGHYHPEKR PTPQNWVDAL ESAINDLVQC KRVDTHWYSK TYGKCYWCER EEKLEVDIFS DSKT
|
| |