Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3768 |
Symbol | |
ID | 5735632 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4736125 |
End bp | 4737702 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280920 |
Product | ankyrin |
Protein accession | YP_001546532 |
Protein GI | 159900285 |
COG category | [R] General function prediction only |
COG ID | [COG0666] FOG: Ankyrin repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAGTA TAACGATCGA TGAGCGGCTT TTTGAGGCAC TTCGTTCTGG GCAAAGCTTA ATTGCTACTG AGATTCTTGC TGCACACCCT GAGCTAGCAA CCACCCGCTT CGATCCCATC GATCAATCGC TGAGTGTTAC GTCTTTCGTA TGCCATCCAG AGCAATACGG CAATGCTCGG ATCGGTCAAT CCCCACTCCA TTTGGCAGCA TGGAACGGTG AACAGCGCTT GGTTAAGCAG CTACTTGAAC TTGGTGCTGA CCCCAATGCC CGCGATCGGC AGGGTGGCAC GCCGCTGCAT GCGATGGTAC GCTGGGTTAC CCGACCTGAT ATTGTCGGCA TGGTATTGGA ACGAGGCGCA GATATTAATG CCGTTGATTA TGCTGGGCAA ACACCATTAC ATTTAGCGGC TAGTTGCATT CGTCGCCCGG GTCATCAATG GGGCAATCAC ACCGACCTGT GCAACTTTTT GTTAGCACAT GGTGCAATTG CCGATATTTT CGCGGCAGTC ATGCTTAATT TAACCGATCA GGCGGCGATG CTGCTCAAGC AAAATCCTGA ACTAGTGCAC GCCCGCACAA CTGGCAATCA GACCCATCCA GAAAGCGCGA CACCATTGCA TATTGCCGTA GATCGTGGCA AGCAGGCCAT GGCGGAAATG CTGCTGGACT ATGGTGCTGA TCCCAATAGC CTCGATGCCC GTGGTCGCCC AGCCTTGTAT CTGGCAGCGC ATATAGCCGG AACGCGCAAA CTAGAGCCAA CCCCTGAACT GGTAGATCTG TTGTTACAAC ATAGTACAGC TACACCGATC TTCAATGCCA GCCTGATCGG CCAGTGTGCT GAACTTCGTG AGTTGCTTAT CCACGATCCT GCACAAATTC AGGCGCTTGA TCAAGCTGGA TATACTGCCC TGCATTTGGC GGCATGGAAT GGTCAAGTTG CAGCGGTTGC CGAATTATTG GCGCATGATG CCGATATTGC TGCCCGAACC AAACGCAACG AAACCGCCCT GCAACTCGCA ATAACCTATG GTCACCATGC AACTGCCGAA CTGCTGCTGA ACCATGGCGC AACTCCCGAT ATATTTAGTG CTGTTATCCT TGGTCGGATT GATCTGCTGG AACAATTGCT GGATCATCAA TCCGAACTCG CCAGTACCAC CAATCGCTAT GGACGCACGC CGTTGCGGCT GGCAATTGAA CGTGAGCAAA CAGCAGTTAT CGATTATTTG ATTGGTCGAG AAGTTAAACC CGACCTATGG ATGGCGGCAG GTATGGGCGA TTTTGCCAGG GTCGAAGCCT TAGTCGAAAC TGATCGTCAC GCTTTACATC AGCGCGATCA ATGGGGCTAT ACTGCGTTAC ATTGGGCCAG TAAATCTGGG CAACTTGCGG TGATCGAATA TCTGCTTGAG CAGGGTGCTG GCTTGGAGCC GCGCGGCTCT GATGGTGGCA CGCCGCTTAC CTTGGCCTTG TGGCATGAAC AATCGGCAGC AGCCCGCCTG TTGGTTGCTA GCGGCGCTGA TATTGATGCT CTAGACAATT GGGGTGGTTC ACCACGTAAT CAAGTAGCAA CGCTCTAG
|
Protein sequence | MNSITIDERL FEALRSGQSL IATEILAAHP ELATTRFDPI DQSLSVTSFV CHPEQYGNAR IGQSPLHLAA WNGEQRLVKQ LLELGADPNA RDRQGGTPLH AMVRWVTRPD IVGMVLERGA DINAVDYAGQ TPLHLAASCI RRPGHQWGNH TDLCNFLLAH GAIADIFAAV MLNLTDQAAM LLKQNPELVH ARTTGNQTHP ESATPLHIAV DRGKQAMAEM LLDYGADPNS LDARGRPALY LAAHIAGTRK LEPTPELVDL LLQHSTATPI FNASLIGQCA ELRELLIHDP AQIQALDQAG YTALHLAAWN GQVAAVAELL AHDADIAART KRNETALQLA ITYGHHATAE LLLNHGATPD IFSAVILGRI DLLEQLLDHQ SELASTTNRY GRTPLRLAIE REQTAVIDYL IGREVKPDLW MAAGMGDFAR VEALVETDRH ALHQRDQWGY TALHWASKSG QLAVIEYLLE QGAGLEPRGS DGGTPLTLAL WHEQSAAARL LVASGADIDA LDNWGGSPRN QVATL
|
| |