Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3993 |
Symbol | |
ID | 5735854 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5093823 |
End bp | 5096441 |
Gene Length | 2619 bp |
Protein Length | 872 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641281143 |
Product | hypothetical protein |
Protein accession | YP_001546753 |
Protein GI | 159900506 |
COG category | [S] Function unknown |
COG ID | [COG1572] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAACAC GTCTGCTGCG GTGCTGCTTG CTGATCGGTC TATTGTTAAG TGCCATGCTT CCGCCAATCG TGGAGGCAAC ACCGAAAGCA GCGCCATTGG TGGCCATTTA TATTTTGGCC TACGATAATC GGCTTGATAG CACGATGAAT TTAACCCCGT ATTATGATGC AACCCTCACT AGCATCACTA ATGCCACTGT TGGTCAGCCC GATCTAACGG CGATTGTGTT GGCTGATTTG GCGGGAATGA ATGATACCCA TGTGCGGGTT GTGCAGAATG GCAATGTCAA TACCTTGATT GGCTTGCCCG ATATTGATGG AATTATTGAT AGCAACCTTA AAGAATATGA TGTAACTGAT GGCCGAACCT TGGGCGGGTT TTTATTGTGG GCCAAAAGCA GTTACACTGG CCAAAACTAC ACCTTGAGCT ACATTGGCCA TGGTGTGCCA ATTATGCCCG ATATCGAGAT TTCAAACCTC AGCCAGCCCG AACGCCCAGT CAGCAGCGTC AATCTGCCAC CGTTGCCCAC CCGCATTGGG GCCAATGCCG ATGTGACCGA TCACACGCTG ACCACGAGTA TCAGCGGCTA TAATGCGCTT TCGCCCAATG ATTTGGCCTT GGCTTTGGCA ATTGCGGCTC CGATTGGCCC TCGCTTGGCC GTGCTCGATG TGTTGATGTG CTTTAGTGGC TCGATTGAGG CCTTATACCC ACTTGCACCG TATGCCGAAT ATCTGACTGC CTCGCCCAAC TATGCCTTTT TCGACCCAAC CATGCCCGGT AATGCCTTGC TTGGCTTGAA CAGCAACCCC AACCCGCTAC AAATGGCCCG ACATATTCTC GATAGCTACC ATAATCAACT GCCAAGCAGC GATCATCCAC GGATTTTAAG CGTGATCGAT GCCGATCAAT TAAATAATCT CAAAACCACG TGGGATAGTG CTTCGAACGC CATCTATACC AATTTGCTCA ATCCCAGCCA GCGCGAAGTT ACCCGCACAG CCTTGTTTAA TGCCTACCTT GAGAGCCGAA AATACGATCT GACCTATTGT GAGCCAAGCG ATTGGGAGCT GAATGCGCCC GATGGCTTGG TCGATTTGCG CAGTTTTGCT CATGGTTTGA GTCAAAGTTT CGCCAATCTC AATCCGCAAG TTGCTAGCTT TGCCGCCCAA ACCCGCGATC GGATTCGCAA TAATAGCGGT AATCCGATGG TCGTGGTTTA TCGCTTGGCG GATAACGATT TTCCGTGGTT CGACCCAACT CCAACCCAAT GGATTTTTGA TGGACTGAAT CCACTTGGGC TTGATGATGA TGCCGCTGGC TTGAGTTTAT ATGCCGATTT GCAAGGCCTC TCGGTGGCGG GAGCAACCGA ATTAAGCTGG CAGGCCCATT GGTATCACGA TGACGACACC CAGCCAGATA ATCCGCATCC GTTAGCATTT TTAGCCGATG TGACCCATCG CAACGGCTGG GATGAAGTGT TTCAGGAATA TTGGCGTGAT ATTGAGGTAC AAACAGCGTT GTGTACGCCG AGCTTACCTG CTGCCCGCGA TCAATCGACT CCTCGCGCCG ACATCAGCCT AAGCCAATTT AACCCTGCCG ATAGCAACTT GGCGGTCAAT GAATCGATTC GTTTGAGTGT TATGCTGAAT GTCACTCGCG CAGTGCAACG TAGCGATCTC TGCTTTGAGG TTGTCTTGAA TGGTACAGTC GTATTTACTG ATAGTTTGAT GTTGACCAAA CTTGAGGCTG GTAGCCAACG AATCTATGCG CAAAAAATTT GGCAACCAAC GACTGCTGGA GTTTATAGCT TGCGGGTCGT TGGCGATGGC GGCCAGCATG TGCAAGAAAG CAACGAAAAT AATAATGTGC TTACCCGCTC GCTGAATATT GCGCCAACCG TGCCGCGTCG CCCAATGCTT ATTGTCAAGA CCTCGAACAA TCTGCAATTA TCCAATAGCC CAACCGTTAC GCTGAATGTT CAGCAGCAAG CGGGCAGCGG TACACCAGTT TCAAGGGTGA TTATCCAAGC CTATCAATAT CAGGGCAATG CTGCCAACCC GCGCTTGCAA ACGCCAGTGT TGCGGGCTAC TACCACGATC AATCAACCAA CTTTACCAAC AGTGCAATTA AGCGTAGCAG GCTTAGATCC TGGTGCGGTG GTCTTGTATG TTTGGGGCTA TTCGAGCAAT GGCTATAGCT TGATTCCGGC CATTGTGCGG CTAAATTATG CGCCGTTGCC CGCAACGATC AGCCAAAACC AAAAACATAT CTATCGATTT AGCCTCAAGC GCGACCAAGC CCAAGCCTTT CGTTTACAAA GCCAGTTTGG CAATAGCAAT TTGCACGCAT GGGAACCATA TATCTGGACT GCACCAACTC AACAATCAAC CAGCCTTGGA TTGGATCAAA TTAGCTATAA TCCTACACCG CTAGCTGGCG AATACATCGT GCAAGTAAGT AGTAGTGAGG CTGGTCGCTA TCTGTTTACG GCGATACCTA ATCCGCCAGC AGGTCGCAAC TTTGAAACGA TCACCAGTAA ACAACCACGG CCAATCTTTG AGGAGCCAAT CCCATTCTTG CCAATCGACC AACTGTTTAT CCCAATAGTC CAACGTTAA
|
Protein sequence | MGTRLLRCCL LIGLLLSAML PPIVEATPKA APLVAIYILA YDNRLDSTMN LTPYYDATLT SITNATVGQP DLTAIVLADL AGMNDTHVRV VQNGNVNTLI GLPDIDGIID SNLKEYDVTD GRTLGGFLLW AKSSYTGQNY TLSYIGHGVP IMPDIEISNL SQPERPVSSV NLPPLPTRIG ANADVTDHTL TTSISGYNAL SPNDLALALA IAAPIGPRLA VLDVLMCFSG SIEALYPLAP YAEYLTASPN YAFFDPTMPG NALLGLNSNP NPLQMARHIL DSYHNQLPSS DHPRILSVID ADQLNNLKTT WDSASNAIYT NLLNPSQREV TRTALFNAYL ESRKYDLTYC EPSDWELNAP DGLVDLRSFA HGLSQSFANL NPQVASFAAQ TRDRIRNNSG NPMVVVYRLA DNDFPWFDPT PTQWIFDGLN PLGLDDDAAG LSLYADLQGL SVAGATELSW QAHWYHDDDT QPDNPHPLAF LADVTHRNGW DEVFQEYWRD IEVQTALCTP SLPAARDQST PRADISLSQF NPADSNLAVN ESIRLSVMLN VTRAVQRSDL CFEVVLNGTV VFTDSLMLTK LEAGSQRIYA QKIWQPTTAG VYSLRVVGDG GQHVQESNEN NNVLTRSLNI APTVPRRPML IVKTSNNLQL SNSPTVTLNV QQQAGSGTPV SRVIIQAYQY QGNAANPRLQ TPVLRATTTI NQPTLPTVQL SVAGLDPGAV VLYVWGYSSN GYSLIPAIVR LNYAPLPATI SQNQKHIYRF SLKRDQAQAF RLQSQFGNSN LHAWEPYIWT APTQQSTSLG LDQISYNPTP LAGEYIVQVS SSEAGRYLFT AIPNPPAGRN FETITSKQPR PIFEEPIPFL PIDQLFIPIV QR
|
| |