Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3383 |
Symbol | |
ID | 5735244 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4265585 |
End bp | 4267345 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641280530 |
Product | hypothetical protein |
Protein accession | YP_001546147 |
Protein GI | 159899900 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0654007 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGACGAC GATGGGTTCG TCTTGGCCTG CTCATGCTTA TTGCCATGGG CGTAGTGCTA CCTGGAACCA AGGCCGAGGC TACACCGCCG CCGCCCTTAG CCCAGTATTT CCCCGAAACC GGGCAATCAG CGGTCAACTA TTTCTGGCAA TTTTGGAAGA ATACGCCGAA TGCCATGCGC GTGTTAGGCT ATCCCATTTC TTTGCCATTT GTCCAAGAAA GCTTTACCGA GCCTGGTAAA TTCTATTTGG TGCAATATTT CGAACGCGCC ATCTTAGAAG AACACCCTGA AAACTTCAAT CACCCAACCA ATGGCAACAA ATACTTTGTG CTTGGGCGGT TGCTGGGCAA AGAATTGGCC AAGGGTCGCG AAAATGAACC AGCCTTCAAG CCAGTGGCTA ACCCCAACAA TGGTACGGTT TGGTTCCCTG AAACCCAACA TACTCTGACC AACACGCCTG GCCCATTTTT GACCTTCTGG CGCAATTATG GTGGCCTCTC GGTGTTTGGC TATCCTTTGT CAGAGCCATT CCAAGAGCTG AACCCCGATA CTGGTAAAGT CTATTGGGTG CAATATTTCG AGCGCAATCG CTTCGAATAT CATCCCGAAG AAAAACCTGA ATTCCAAGTG TTGCTTGGTT TATTGGGTAA ACAATACTAT AACGAACACA AAACTGAGCC AAAATTGGCC GTCAAGGAAT GGTTCTTCCG CTATCACACC CGCGCCGAAG CCATCCCAGT CGATTTCATC TATGGCTACA ATGTTCAAGC CTTCTACCAA GAACGTGATC GCTTATACCA ATTGGTCAAC AACGCCGCCT TTGGCTGGAT TCGCCAACAA GCACCTTGGG AAGATCTCCA AGCTGCTGAC GGTACAATCT ACTGGGGTGA GCTGGATAAA GTTATCAACG ACGCTCATGC CAAGGGCATT AAAGTCTTGT TAAGTGTGGT TCGTTCGCCA GAATGGGCCA GCGAAAATGG CACGCACGGC TTGCCATCAC GCCGCAACTT CCCCAAATTT GGCGATTTTA TGCAGCGCAT GGCGCAACGC TACAAAGGCA AAGTCCAAGC TTACGAAATT TGGAACGAGC AAAATTATGC GATTGAAAAC GGTGGCGTGG TAGCTCCAGC AGCCTATTAT GTGGATATGC TGGAATATGC CTACAAAGGC GTGAAAGCTG CTGACCCCGA AGCAATTGTG GTTTCTGGCT CACCAACCCC AACTGCCACC AATCGCACCG ATATCGCGGT TGATGAGTTG ATTTACTTTG CTCAAATGTT TGCGATTCCT AAATTCTGGA ATAACGTTGA TGTAATTGGT GCGCACTTCG GCGGCACCTA CAATCCACCA GATACCAAGT GGCCCGATAA CCCAGGCCCA GGCCCAGGTT GGCGCGACAA CTCCGAATTC TACTGGCGAC GGATTGAAGA TGTGCGCCAA GTGCTGGTCA ATAGTGGCAA CGGCGACCGC CAAATTTGGG TAACCGAAAT TGGCTGGGCC ACCGCCAACA CCAGCCCAGG CTTTGAATAT GGCAACTCGA ATACAATCGA AGAACAAGGA GCTTACCTCG AACGAGCAAT GTATATGGCT CGCTACGATT ACGCTCCATG GGTTGGCGCA ATGTTCGTGT GGAACTTAAA CTTCGCGGTG ACCTCACCCG ATCCGCTGCA CGAAACTGCT TCATTCGGGG TATTGAACCC TGATTGGAGT CCACGCCCAG CCTATACCCG CTTGCAACAG TTTGCAGCAA CCCATAAATA A
|
Protein sequence | MRRRWVRLGL LMLIAMGVVL PGTKAEATPP PPLAQYFPET GQSAVNYFWQ FWKNTPNAMR VLGYPISLPF VQESFTEPGK FYLVQYFERA ILEEHPENFN HPTNGNKYFV LGRLLGKELA KGRENEPAFK PVANPNNGTV WFPETQHTLT NTPGPFLTFW RNYGGLSVFG YPLSEPFQEL NPDTGKVYWV QYFERNRFEY HPEEKPEFQV LLGLLGKQYY NEHKTEPKLA VKEWFFRYHT RAEAIPVDFI YGYNVQAFYQ ERDRLYQLVN NAAFGWIRQQ APWEDLQAAD GTIYWGELDK VINDAHAKGI KVLLSVVRSP EWASENGTHG LPSRRNFPKF GDFMQRMAQR YKGKVQAYEI WNEQNYAIEN GGVVAPAAYY VDMLEYAYKG VKAADPEAIV VSGSPTPTAT NRTDIAVDEL IYFAQMFAIP KFWNNVDVIG AHFGGTYNPP DTKWPDNPGP GPGWRDNSEF YWRRIEDVRQ VLVNSGNGDR QIWVTEIGWA TANTSPGFEY GNSNTIEEQG AYLERAMYMA RYDYAPWVGA MFVWNLNFAV TSPDPLHETA SFGVLNPDWS PRPAYTRLQQ FAATHK
|
| |