Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3163 |
Symbol | |
ID | 5735035 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3993404 |
End bp | 3995056 |
Gene Length | 1653 bp |
Protein Length | 550 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641280306 |
Product | von Willebrand factor type A |
Protein accession | YP_001545928 |
Protein GI | 159899681 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1240] Mg-chelatase subunit ChlD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.181499 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACGCT TCTTCCGTAG TTTTCTAATC GTTATTATGC TGGTGCTTGC AGCTTGTGGC GAGGCAGCCC AACCTACCAC CACCACTAAC CAAACCAGCG GGCCAGTTGT AGGCGCAAAT GATCTACTGA TTAGCATCAC CTACAGCCCA GAGAAAACCA AATGGCTTGA AGAACGCATC ACCACTTTCA ACAACCAAAA TGTGCAATCC AACGGCAAAC GGGTCATCGT TGAGGGCAAA GAGCTTTCCT CAGGCACAGC CCGCACTCAA ATTCGCGCTG GCCAACTTCA GACAACAATC TGGACACCTT CGGCCTCTAC CTGGCTGGAA GTGTTGAAGA AAGAAAGTAA TAATCCCACG ATTGCCGAAG CCGATCCTCA ATCGTTAGTG CTCACACCAG TGGTTATTTC GATGTGGAAA CCAATGGCCG AAGCCATGGG CTACCCGGGC AAAGAGGTAG GCTGGTCGGA TATGTTGGCG CTGATTCAGG ATACCGAGGG CTGGGGCAAA TTCGGTCAAC AAGATTGGGG CCGCTTCTCG TGGGGCCACA CCGACCCCGA CATTAGCACC ACCGCGCTTT CAACCGTGCT GGCTGAGTTG TATGCAGCCA ACGGCAAAAC CAGCGATTTG ACGGTCGAAG ACATCAATCA AGAAAAAAGC CAACAATTTT TGCGTGATTT AGCCCAAGGC ATCAAGCACT ATGGCTCAAA TACCTTGGTG TTCAGCCAAA ACATGCAAAA ATATGGCATG GCCTATATTT CAGCCTTTCC GATGGAAGAA ATTACCCTGA TTGATTTCAA CAAACAGGCT CCCAATGTCC CGTTAGTGGC AATCTATCCC AAAGAAGGCA CGTTTATTCA CGATAATCCC TTTATTGTGA TGAGCGATGC AACTGCCGAC CAAAAAGCTG CTGCCAGCGT TTTCTATGAT TTCTTGCTTA CGCCTGAAAG TCAAAATTTG GCCATGCAGC AAGGTTTTCG GCCAGCGAAC GTTGATGTAG CGTTGGCATC ACCATTAACT GCCCAATTCG GCGTAGATCC CAATCAACCA CGGAATTCGT TGGCAACCCC ACCAGCCGAT GTGATTGTGG CTGCCAAAAA TGCTTGGGCT AATAATCGCA AGCCCGCCAA TATTATGTTG GTGGTCGATA GCTCTGGCTC GATGCGCGAC GACGACAAGA TGGATCAAGC CAAACTTGGG GTTGAGGTGT TTCTCAATCG CTTGCCAAGC AAAGATAACG TCGGCATGAT CGGCTTCTCA TCAAGCCCAG CCGTGTTGGT GCCATTGGCA ACTCGTAGCG AAAACATGGC TAATTTGCAA ATGCAAACCC AAGGACTCGT GCCCGATGGC AACACCTCGC TCTACGATGC GATCGATTTG GCTCGTCAGG AATTGGAAAA CCTCAAACAA CCTGATCGGA TTAACGCGAT TGTGGTGCTA AGCGATGGTG CTGATACGGC CAGCCAGCTT TCAATCGATC AAATGCTCGG TAATTTTGGC GAATCGAGCA TTCAAATCTT CCCGATTGCC TATGGTGCTG ATGCTGAAAC TTCAATTTTG CAACAAATTG CCGATTTCTC CCGCACCGAG TTGGTTCAAG GTAGCACTGG CGATATTGAT AAAATCTTCG AAAATTTGAG CCGCTACTTC TAA
|
Protein sequence | MQRFFRSFLI VIMLVLAACG EAAQPTTTTN QTSGPVVGAN DLLISITYSP EKTKWLEERI TTFNNQNVQS NGKRVIVEGK ELSSGTARTQ IRAGQLQTTI WTPSASTWLE VLKKESNNPT IAEADPQSLV LTPVVISMWK PMAEAMGYPG KEVGWSDMLA LIQDTEGWGK FGQQDWGRFS WGHTDPDIST TALSTVLAEL YAANGKTSDL TVEDINQEKS QQFLRDLAQG IKHYGSNTLV FSQNMQKYGM AYISAFPMEE ITLIDFNKQA PNVPLVAIYP KEGTFIHDNP FIVMSDATAD QKAAASVFYD FLLTPESQNL AMQQGFRPAN VDVALASPLT AQFGVDPNQP RNSLATPPAD VIVAAKNAWA NNRKPANIML VVDSSGSMRD DDKMDQAKLG VEVFLNRLPS KDNVGMIGFS SSPAVLVPLA TRSENMANLQ MQTQGLVPDG NTSLYDAIDL ARQELENLKQ PDRINAIVVL SDGADTASQL SIDQMLGNFG ESSIQIFPIA YGADAETSIL QQIADFSRTE LVQGSTGDID KIFENLSRYF
|
| |