Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0075 |
Symbol | |
ID | 5731948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 97351 |
End bp | 98661 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641277197 |
Product | hypothetical protein |
Protein accession | YP_001542855 |
Protein GI | 159896608 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.258844 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTCCAT CGCGCCGTTT ATTAGCACTT TTGCTAGCAG GCTTGGTTCC GGTGATCATC GGAGCAGCAA CCCCCAGTTT GCGCTGGACA ATCTGGGTTT ATGTATTGTT GCTGATTGGC TTTGTGGCGC TTGATTGGTT TATGACTCCC AAGCCTAAAT TGTTGGAAGT AGCGCGGATC AACGAGCCAA AACTTTCAAT TGGCGAACAA AACCTGATCA CGCTGGCAGT GCATAATCAA AGCCCGCGCA CGCTCGAAAT TCAAATTCGT GATGAGTTTC CGGTGGAGTT TCCCAGCGAT ACGCTGATTC TTAAAACCAA GGTCGAGCCA GATACGGTGC AAGAGGTTAA CTACCATGTA CGGCCCTTGC GGCGTGGTGA TTATCGCTTT GGCAATATCA ATTTGCGCTA TACCAGCACC TTTGGTACGT TCTTGCGCCA AACCAAAATC GCCTTCGACG AATTGGTCAA GGTTTATCCC AATGTGCTTG ATGTGCGCAA ATACGATATG TTGGCGCGGA AGGGCATGCT GTTTGAGTTG GGGTTGCGCA CCGCCCGCGT GTTTGGCTCA GGTACCGAGT TTGAGCGCTT GCGTGAATAT ACGCCCGATG ATGAGTTTCG CTCGATTAAC TGGAAGGCCA GCGCTCGCCG TAACAAGCTG ATTGCTGCCG AATATGAGAC CGAGCGTTCG CAGTATGTAG TGTCGGTGAT CGACACTGGG CGTTTGATGC GCCCAACAAT CAACGATATC GCCAAGCTGG ATTATGCGAT CAATGCCTCG TTAATGCTGG GGTATGTGGC GATGCTCAAA GGCGATCACA TCGGCATGCT TTCGTTTGCC GACCATGTTG GGCGTTTTTT GCAGCCGCGC CGTGGTAAAG CCCAGTTTTA TCAAATGTTG GAGATGTTGT ATAACTTGCC ATCGCAGCCC GTCGAGGCCG ATTATGGCCG CGCGATCTCC TACTTGGGCT TGAAGAATAA GCGCCGTTCG TTAATTGTGA TTTTTACCGA CCTCAGCACC ATGGATACCG CCAAGCCGCT GATTCAGCAT ATGGCACGCT TGGCCAAAAC CCACCTCGCC TTGTGTGTGG TGATGAGCGA CCCCAACTTA GTCGGCTATG CTGGCAAAGC GGCCTATAGT TCCACCGATG TGTATGAACG CGCCGTGGCC GAGATGGTGC TTGATGAACG GCGGGTAGTG CTCGACACGC TGAATCAAGC TGGCGTACAT ACGATCGACG TGCCAGCCAA CAAACTAACG GTTTCGGTGA TTAATAAATA TCTGGAGTTC AAAGGGCGAG GGCTTATTTA A
|
Protein sequence | MIPSRRLLAL LLAGLVPVII GAATPSLRWT IWVYVLLLIG FVALDWFMTP KPKLLEVARI NEPKLSIGEQ NLITLAVHNQ SPRTLEIQIR DEFPVEFPSD TLILKTKVEP DTVQEVNYHV RPLRRGDYRF GNINLRYTST FGTFLRQTKI AFDELVKVYP NVLDVRKYDM LARKGMLFEL GLRTARVFGS GTEFERLREY TPDDEFRSIN WKASARRNKL IAAEYETERS QYVVSVIDTG RLMRPTINDI AKLDYAINAS LMLGYVAMLK GDHIGMLSFA DHVGRFLQPR RGKAQFYQML EMLYNLPSQP VEADYGRAIS YLGLKNKRRS LIVIFTDLST MDTAKPLIQH MARLAKTHLA LCVVMSDPNL VGYAGKAAYS STDVYERAVA EMVLDERRVV LDTLNQAGVH TIDVPANKLT VSVINKYLEF KGRGLI
|
| |