Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4658 |
Symbol | |
ID | 5736505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5952724 |
End bp | 5955108 |
Gene Length | 2385 bp |
Protein Length | 794 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281822 |
Product | mucin 2 |
Protein accession | YP_001547417 |
Protein GI | 159901170 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0296531 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCTGCT CATCATTAAT GGTTGTGCGC TGGACAACTC CGGTTAATGC TTCTGGGCCT GAGGTCTTGC TGCCACCATT TGCCAATTAT GAGGCTTGGA ATGGCACAGC ACCATTTCCG ATTGGACAAG AAGCGATTCG TAGCAAAATT AGCGATCAGC CAGCTAACCC AAATTTAGTT GCAGCCGAAA CGCCGATTCC TATTGGGATT GTTGATTTGT TGCCAACCGC CACGCCAACT GGCCGACCTA CGCCAGCAAA TGAAGCAATT GTAACCTCAA CTACAACGAC TACCCCTACT CCCACCAAGG ATGAGGGTAT TCCGATTGGT AGTGCAACGG CGGCAACGAC CGAGCCAACT ATCGAATTAA CTGCCGAACC AAGCTCAACG CCACGGGTTA ATCCAAGCTC GACGGCTGGT ACGCGCACGC CAACCCTGAA TCCAACGATT AATCCAACGC GACCAACCTT TGAACCAACC TTTACGAGCA CGCCAACCCG AACGAGTGTT GTGGTGGCTA CTACGGTTGT GCCAGCCAGT GCAACCTCAA CCAATACTGC AATTCCAACG TCAGCGCCAA CAGCGACCAA CACGCGCGTG CCTACGATCA CGCCAGTGCC AGCGACGCTG ACCAACACGC CAACCGACGT AGCAACGATC ACACCAGTAC CAGCAACGCC CAGCAACACG CCAACCGACG TGCCAACCAA TACGCCTGTG ATCGTTACGG CGACCAATAC GCCAATTCCA ATTTTTACTA CGCCGAGCAA CACGCCAACC AATACGGCGA CCAATACGCC GAGTAATACG CCAACCAATA CGGCGACACC GACGAATACA CCGAGCAACA CGCCAACCGA TACGGTGACA CCAAGCAATA CGGCTACGCC ATCGAACACA CCAACGCCAA CCGATACGGC TACGCCAACT GCCACGCCAG CACCAGAGCT GTATATTGCT TGGCGCGTTG ATGCGATTGT CAATCCAGTT AATCCATCGA TGGAAAATAA TGATACCAAA CAGGTATTCG TGGTGTTTGG CAATGCTGGC GATGCTCCGG CATCAGGGGC GCAAGTTAAT ATTAGCGTAA CTGGAACTTG TATTAGCTCA AGTATTAGCA GCAGTGGCGT ACCAATTACG CTTGGGGCGC ACCAAGGCTT TACGCTTTCG CCCACAATTT CAGCCAATAA TGTTGGCAAT TGTTCGATTA CCGCAGTATT AACGGCGATT GGTCAAACCC CAGTTCAAGC AACCTTGAAT TGGACGATTG TGTGTGATGG TTGTGCTACT GTAACCCCGC AACCAACCAA CACGCCAACC CGTACACCAA CCAACACGCC AACCCGCACA CCAACACCAA CCAATACTGC GACTCCATCG AACACGCCAA CGCCAAGCAA CACGCCAACG GCGACCAATA CTCATACGCC AACGACAATT CCAACCTTGA CCTATACGCC AACGCCCAGC AACACGCCGA CGGTCACCAA CACGCCAACG CCCAGCAACA CACCAACCAA TACGCCAACG CCGAGCAACA CGCCGACGGT GACCAATACG CGCACACCAA CCAATACACC AACAATTACC AACACGCCAA CGCCGAGCAA CACGCCAACG GTGACCAACA CGCCAACGCC AACTAGTACG CCAACGCCAA CTAGTACGCC AACGGTGACC AATACGCCCG TTGATACACC AACGCCAAGC GAAACACCAA CGCCGAGTGA AACGCCAACA CCAACTAGTA CGCCAACCCC AACTCCAGAT CTGTTTGTGT TCCACATCGT CAATGGTGTG GTTAATCCAG CAGGCTCGAT GACATTGGCG GCTGGGGCGC AAGAAAATGT AACGGTTGTG TTTGGCAATA ATGCTAGCGG CTCGCGGGCA ACTGGCTTGA ATTTCAGTTT CAGTGGCGGG GCATGTATTA GCGCTCAGCC TGGCTCAAGC GATTCAAGCG ATTTGAATGG CGGCGTAAAT CGCTCGTTAT CAGTCATTGT GACGGGTAAT GCAGTTGGCT CATGTTCGTT CCGCACTCAA TTTAGTGCCA GCAACGCCAA TACCGTCAGT GTTGATAGTA GTTTTACCGT GGTCAATACC GCACTAAATC AACCAGCGAT TGCTGCAACT GCCACGGCAA CTTCAACTGC CACAGCTACG GCTACCGCTG AACCAACGGC AACTCTTGCG CCAACCGCTA TGCCAACCGA TCAACCAACA GCTGAGCCAA AAGCGACGCT CGAACCAACC TTGCCAGTAA TTGGTCAAAG CTCAGGGTTT CCACCAATGT CGGGTGGCCA AATGCTTTGG TTGCTGGCGG GCGGGCTAGC TATTCTGCTC AGTGGCTTGC GCGGGCGAAG AATCTTACCG CTTAACGTTG CCTAA
|
Protein sequence | MGCSSLMVVR WTTPVNASGP EVLLPPFANY EAWNGTAPFP IGQEAIRSKI SDQPANPNLV AAETPIPIGI VDLLPTATPT GRPTPANEAI VTSTTTTTPT PTKDEGIPIG SATAATTEPT IELTAEPSST PRVNPSSTAG TRTPTLNPTI NPTRPTFEPT FTSTPTRTSV VVATTVVPAS ATSTNTAIPT SAPTATNTRV PTITPVPATL TNTPTDVATI TPVPATPSNT PTDVPTNTPV IVTATNTPIP IFTTPSNTPT NTATNTPSNT PTNTATPTNT PSNTPTDTVT PSNTATPSNT PTPTDTATPT ATPAPELYIA WRVDAIVNPV NPSMENNDTK QVFVVFGNAG DAPASGAQVN ISVTGTCISS SISSSGVPIT LGAHQGFTLS PTISANNVGN CSITAVLTAI GQTPVQATLN WTIVCDGCAT VTPQPTNTPT RTPTNTPTRT PTPTNTATPS NTPTPSNTPT ATNTHTPTTI PTLTYTPTPS NTPTVTNTPT PSNTPTNTPT PSNTPTVTNT RTPTNTPTIT NTPTPSNTPT VTNTPTPTST PTPTSTPTVT NTPVDTPTPS ETPTPSETPT PTSTPTPTPD LFVFHIVNGV VNPAGSMTLA AGAQENVTVV FGNNASGSRA TGLNFSFSGG ACISAQPGSS DSSDLNGGVN RSLSVIVTGN AVGSCSFRTQ FSASNANTVS VDSSFTVVNT ALNQPAIAAT ATATSTATAT ATAEPTATLA PTAMPTDQPT AEPKATLEPT LPVIGQSSGF PPMSGGQMLW LLAGGLAILL SGLRGRRILP LNVA
|
| |