Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1043 |
Symbol | |
ID | 4709797 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 1127722 |
End bp | 1130067 |
Gene Length | 2346 bp |
Protein Length | 781 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639855514 |
Product | von Willebrand factor, type A |
Protein accession | YP_001002621 |
Protein GI | 121997834 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4548] Nitric oxide reductase activation protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGTGC GTCTCGAGGA CTACCGCGAG ACCCTGGTGG CGGCGGATCC GCGGATCGCC GAGACCCTGG AGGCGAGCTT CGCCGAGGCG TCGCGGGTGA TGTCGCCGCG GGGGTTGCAC AACTACCTGG AGGGTGCGCG GGCCCTGGCC GAACTCGGCC GCGGTGCCGA CCTGGTCATC GGTTATCTGG AGGCGATGCC GGCGGTGGCC AAGGAGGTCG GTGAGGACGT CCTGCCCGAG GTGGTCACCG CCGCCATGAA GCTCTCCTCC ATGGTCAGCG GCCAGGTGAT CGCCCTGCTG CTGGCCGGGC TGCCCACGGC CGCGCGGCGG CTCGGCGATC CGGACTTGCT GCGCCAGTTC CTCAACCTGG TCCATCAGCT CTCGGCCAAG GCCCCGCGCG GCCTGCGGCC GATGCTCGAG AACCTCGACG AGCTCTTCAC CAAGCTGACC CTCGGCGGAT TCCGACGCTG GGCGCTGTGG GGGGCCCAGG CCCACGCCCG CGACTTCGAG CAGCAGCGGG GTTACTTTGG CCTGCAGACC GCCGACAGCC AGGCGATGCT CCAGCAGGAG CGCCGCGGCA CCCTGCTGGT GGACAACCAG CGCAAACTCA ACTTCTATCT GCGGGCCCTG TGGGCGCGTG ACTTCTTCAT GCGCCCGACC TCGGGCGACT TCGAGACCCG CGAGGGGTAC AAGCCGTTCA TCGAGCAGCG GGTGATCCAC CTGCCCGATG CCTACGACGA TTACCACGGA CTGCCCGGCA AGGAGCTCTA CCGAGCCGCG GCCGCCCACG CCGCCGCGCA CCTGTTCTAC ACCACGGAGC CCTTGTCGCC GGAGATGCTC AATCCGGCGC AGATGGCCGT GATCGGTCTG ATCGAGGATG CCCGGATCGA GGCGCTGGCC ATCCACGAGT TTCCGGGGCT GCAGCGACTC TGGCAGCCCT TCCACCAGGC CCGGGCCAGG GAGCGCGCCG GCGAGGACCC GGACCCGGTG GTCGAGCGCC TGGAACGGGC CGCCCACGCC CTCATCGACC CGGAGTACGC CGACGAGGAT CCCTGGGTGC AGCAGGTCCG CGAACTCTTC GCCTACCACT TCGCGGATCG GCCACGGGAT GTGGGGCTCT CCTGGGAGTT GGGCATGGAG CTGCACAACA GCCTGGCCCA GCGCTATTCC ATGCCTTCGG CCCGGGTCCT GGAGCGGGTC GCCGTTCCCT ACCGCGATGA CAATCGCTAT CTCTTCGCCA GCGACGAGGA CGAGTGGCTC GAGGCCGAAT ATGTCCCGGC CAGCCACCGT CAGGTGCGCC GCCAGGTCAA CCTGATGGAG TTCGTCAACG AGGTCGATTG CGAGCTCGCC GGCGACGACG CCCAGGAGGT CTGGGTCCTC GGCACGGAGC TCTTCCCCTA CGAGGACTAC GGGGTGAGCT ACAACGAGAT GGAAGGGGTG GAGCCGGTCA GCCCGCCGTT CCACTACCCG GAGTGGGATT ACCAGGTCCA GCTCAATCGC CCCGACTGGG TCACGGTGGT CGAGCGCCGT CCGAAACGCG GCGACCCCGA GGTCATGGAT CAGGTCCTCA AGGACTACCG CCCGGTGGCC AGTCGGCTGC GCTATCTCAT CGACGCCCTG CAGCCCCAGG GAGTGATCCG CGAGCGGCGG CAGGAGGACG GCGACGAGCT GGACATCGAC GCGGCGGTGC GCTCCATGGT GGATCTGCGC CTGGGCATGA GCCCGGATCC GCGCATCAAC ACCCGGTATA TCCGCAAGAC GCGGGATCTC TCGGTGCTGC TGCTGCTCGA CCTCTCCGAG TCCACCAACG ACCCGATGGG CGGTAGCGAG AAGACCGTCA TCGAGTTGAC CCGGGAGGCC ACGTCGTTGC TCGGCTGGGC CATCAACGGC ATCGGTGACC CCTTCGCCGT GCACGGCTTC TCTTCGGACG GTCGGCACGA TGTGCAGTAT TACCGGTTCA AGGACTTCCA TCAGCCGTGG GGCGAAGAGG CGAAGTCCCG CCTGGCCGGC ATGCGCGGGC AGTACTCGAC TCGCATGGGC GCGGCAATGC GCCACGCCGG GGCCCACTTG GTCCGCCAAC CGCAGCGACG CAAGCTGTTG CTGATTGTCA CCGACGGAGA GCCCCACGAC GTCGATGTGC GTGATCCGCA GTACCTGCGC CACGATGCCC GCAAGGCCGC CGAGGAGCTG TCCGCCCGGG GAGTCACCAG CTACTGCCTG ACCCTGGACC GGGATGCGGA CGCCTACGTG TCACGGATCT TCGGGGCCAA CGGGTACTCG GTGGTCGAGC AGGTCGAGCG CCTGCCGGAG CGTCTGCCGG CGGTCTTTGC CCAGCTGACC CGGTAG
|
Protein sequence | MTVRLEDYRE TLVAADPRIA ETLEASFAEA SRVMSPRGLH NYLEGARALA ELGRGADLVI GYLEAMPAVA KEVGEDVLPE VVTAAMKLSS MVSGQVIALL LAGLPTAARR LGDPDLLRQF LNLVHQLSAK APRGLRPMLE NLDELFTKLT LGGFRRWALW GAQAHARDFE QQRGYFGLQT ADSQAMLQQE RRGTLLVDNQ RKLNFYLRAL WARDFFMRPT SGDFETREGY KPFIEQRVIH LPDAYDDYHG LPGKELYRAA AAHAAAHLFY TTEPLSPEML NPAQMAVIGL IEDARIEALA IHEFPGLQRL WQPFHQARAR ERAGEDPDPV VERLERAAHA LIDPEYADED PWVQQVRELF AYHFADRPRD VGLSWELGME LHNSLAQRYS MPSARVLERV AVPYRDDNRY LFASDEDEWL EAEYVPASHR QVRRQVNLME FVNEVDCELA GDDAQEVWVL GTELFPYEDY GVSYNEMEGV EPVSPPFHYP EWDYQVQLNR PDWVTVVERR PKRGDPEVMD QVLKDYRPVA SRLRYLIDAL QPQGVIRERR QEDGDELDID AAVRSMVDLR LGMSPDPRIN TRYIRKTRDL SVLLLLDLSE STNDPMGGSE KTVIELTREA TSLLGWAING IGDPFAVHGF SSDGRHDVQY YRFKDFHQPW GEEAKSRLAG MRGQYSTRMG AAMRHAGAHL VRQPQRRKLL LIVTDGEPHD VDVRDPQYLR HDARKAAEEL SARGVTSYCL TLDRDADAYV SRIFGANGYS VVEQVERLPE RLPAVFAQLT R
|
| |