Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3752 |
Symbol | |
ID | 5735616 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4716715 |
End bp | 4719036 |
Gene Length | 2322 bp |
Protein Length | 773 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641280904 |
Product | hypothetical protein |
Protein accession | YP_001546516 |
Protein GI | 159900269 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000190776 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGTG CGCGTTTATT AATGTTCGTG GTGCTAGCTG GCTTGTTAAC GCCGCTGGGT GTATGGGCGC GGCCTGTAAC GCCACCCCAA GCCCAGCCGC AAGCCCCGAA TACAGTCAAT TTTTTCGTAG ATCCTAATTC CTTTGCAGAT ACCATGGTTC CTGGGCAATC GCGCCAATTT GCATTTCAAA TTCAGAGCCT AAGTAGCTCT AATGAAAGTG CAAATTTTAC ATTTGCTCCA GAGGGATTAC CACCCCAAAT TACAGTTAAT GCAATTGCTC CTGTGACAGG GGTAAATCCA AGTGATCAGG TTACAGTTTT GGCTACAATT AATATTGCAG CTACTGCTCC CTTGCAAACG TATACATTTA GGATTCGTGT TACTGCTACT GGCAATACTA CAGGTGTTGC CAGTTCAAGA ATTGTTGATT TTCTTATTAA TGTCGTAGCG CCAACAGCAA CTCCAACTCG TACTAATACA CCTACCGCCA CTGCTCCAAC GGCTACCAAT ACTGGCACGC CTGGGCGGAT TTGCGATGGT AATGGTGGCA CGGTTAACGA TAAATTTGAG CCAAATAATA CCCGCAGCCA GGCTCGCCGA ATTGAAGTTG ATGTGCCGCA AGTGCACGCG ATTTGTCCAG TTGGCGACGA AGATTGGCTC TTATTTGGCG GTTTAGAGGG CAAAGTCTAT ACGATCGATG TTTCGCAAAT GGTCGATGGG CTGGATCTTT CGTTGACGCT TTACGATAGC AACGGCAATC AATTGGCCTT TAACGATGAC TTTCCGCGCA ACAATGATCC CAGCGATATC AAGCCACGAA TTCAATCGTG GCGTGCGCCC GCCAACGGCC AATACTACAT TAAGGTGCGC GATTCCGCCG GACGCGGCTT TATTGATGCG CTCTACACAG TGGTGTTGAA TAGTGAAAGT TATGGCCCCA CGCCAACGCT CATCCCTGAA ATTTGTAAAG ATCTCTACGA GCCGGATGGC TTGCCTGAAA TTGCACCGTT GATTGTGGTT GGCGAAGTTC ATCCCGACCA TCGGTTGTGC CCACGCGGTG ATGCCGATTG GGTCAAATTC TTTGGCAAAG CTGGCAAAGT CTATTCGATC TTTACCTCGG AACTCAGTGT TGGTGCTGAT ACAGTGATGG TCTTGGCTGA TCGCGATGGC ACAACGATTA TCGATTTCAA CGATGATTAT GAATCAGGCT TGGATTCGCG AATCGATTTC GCGCCGTTCG TCGATGGCTT CTATTTTGTG CAGGTCAAGA ATGTTGGCGA TGTTGGCAAT CAGTTTATCG ATTACACCTT GACCTTCCAG ATCAAAACTA ATGCCAACCA AGGCGAACCA ACCATGCAGC CAACCGCAAC CTTCGAAGAT GATATTACGC CAACTTTCGA GGATGATGTT ACGCCAACTA GCGATCCTAA TCGCACGGCA ACCGCGACTG GCACGGCGAC CCGCACGCCA ACTTCGGCCT ATCCAACCCC AACCACTTCA TCGAGCAACA AATTGCCCAA CTTCGATACG CGCAGCAATG GCAAATTTGC CGACCCAGCC TTTAACAAGG TTTGGGCTTA TGCCGATGCT CCAGTGGCGA GTGGCCAAGC AGTGCGCTCT TGGTTGTGGG GGCCGAGCAG CGGCCAAGCC CGCGCCGAGG TTTACGATCA AGCGCCTGGT GGTTTGCGTC AAGTGCAATA TTTCGATAAA TCGCGCATGG AAATTAGCGA TTTCGAGGCC GACCGCCAAA GCCAATGGTT TGTGACCAAC GGCTTGCTGG CGAAGGAATT GATTCAGGGC CAGATTCAAA TTGGCGATAG CAATTATGTG CAGCGTAGTC CGGCCCAAAT TAATATTGCT GGCGATTTAG GCGCTGCTTC GGCTCCAACC TACGCCAGTT TTAGCAATTT GCTTGGCGCA ACCAGCGATC GTACTGGTCA ATTCGCCGAT CAGCAATTAG CGCGTAGTGG CAAAGTGAGC GCTTATGCTG GCGCTGCAAC TGATGCAGCT AAGTTGGTGC ATTATGTGCC ACAAACTGGC CACAACATCC CTAGCGCCTT CTGGGATTTT GTCAATCGTC AAGGCTTGGT TAGCCAAAAT GGCCGTACCC AAAACGGCCA AGTGATGGAT TGGGTTTTTG CTTTGGGCTA CCCAATTAGC GAAGCCTACT GGGCCAAGGT CTATGTTGGT GGTGTTGAGC AAACTGTGTT GGTGCAAGCC TTCGAGCGCC GCGTGCTGAC CTACACTCCC AGCAATCCCG CCGATTGGCA AGTCGAAATG GGTAATGTCG GCCAACACTA CGAACAATGG CGCTACCGCT AG
|
Protein sequence | MKRARLLMFV VLAGLLTPLG VWARPVTPPQ AQPQAPNTVN FFVDPNSFAD TMVPGQSRQF AFQIQSLSSS NESANFTFAP EGLPPQITVN AIAPVTGVNP SDQVTVLATI NIAATAPLQT YTFRIRVTAT GNTTGVASSR IVDFLINVVA PTATPTRTNT PTATAPTATN TGTPGRICDG NGGTVNDKFE PNNTRSQARR IEVDVPQVHA ICPVGDEDWL LFGGLEGKVY TIDVSQMVDG LDLSLTLYDS NGNQLAFNDD FPRNNDPSDI KPRIQSWRAP ANGQYYIKVR DSAGRGFIDA LYTVVLNSES YGPTPTLIPE ICKDLYEPDG LPEIAPLIVV GEVHPDHRLC PRGDADWVKF FGKAGKVYSI FTSELSVGAD TVMVLADRDG TTIIDFNDDY ESGLDSRIDF APFVDGFYFV QVKNVGDVGN QFIDYTLTFQ IKTNANQGEP TMQPTATFED DITPTFEDDV TPTSDPNRTA TATGTATRTP TSAYPTPTTS SSNKLPNFDT RSNGKFADPA FNKVWAYADA PVASGQAVRS WLWGPSSGQA RAEVYDQAPG GLRQVQYFDK SRMEISDFEA DRQSQWFVTN GLLAKELIQG QIQIGDSNYV QRSPAQINIA GDLGAASAPT YASFSNLLGA TSDRTGQFAD QQLARSGKVS AYAGAATDAA KLVHYVPQTG HNIPSAFWDF VNRQGLVSQN GRTQNGQVMD WVFALGYPIS EAYWAKVYVG GVEQTVLVQA FERRVLTYTP SNPADWQVEM GNVGQHYEQW RYR
|
| |