Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4361 |
Symbol | |
ID | 5736221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5570330 |
End bp | 5572483 |
Gene Length | 2154 bp |
Protein Length | 717 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281522 |
Product | kelch repeat-containing protein |
Protein accession | YP_001547121 |
Protein GI | 159900874 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.423968 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTACGC GTACCCTTCG GATTGTGCTC CTGCTGGGGT TGGTGCTGCT CTTTTTTGGT GGCTCGTATA CCCAAGCGCG TGAGCTACAA CAGCCCATGC CACGCTCGCA AACCGTGCTT GTACCAACAA CCTACATCAA CGAAAGCTTC GATAGCATTA GTTTTCCGCC GCTCAATTGG TCAACAACAA TTATTACCTC GACCGACACG CCTGATCCTG AATGGACGTA TGTGACGAGT GCAACCCGAC CGACGGCCCA GCCTCATACG GGCGCGGGTA TGGCGCATTT CAATAGCTAC TCCACAATCA ATGGCAATGC AGCGCGATTA TCAGTCGTGC TTACCCCCAC AACCAGCGTG TTGCGGGTAA GTTTTTGGTA TTACCATAGC GCGATTTTTC CAACTTCTGC CGATACGTTG CTGTTGCAAA CCAGCAGCGA TAACCAAAAC TACATAACTC GTGCTAGCTA TCCACGCTAT CGAGCAACCG ATGGTTGGAC GCAGTATCAG CTTGATTTGC CATTTGCTAG CGCCGGCCAG CCATTTTACC TTGGATTTTT AGGCATCAGC GATTTTGGGG CCAATCTTTT GCTCGATGAT GTGCTGATTC AAGATACGCC GCCGATTGAA ATTTTTGGCA CAACCAGCAA TCAAGGTTGT GCTGGCGATA CCTTGCTCTA TCCTTTAAGT GTCCGCAATA ATTACCCTAA TGCGCAAACG CTCGACCTGA ATTTGGCTGT AAGTGCCTGG CCCAGCAGTT TGCCGTTTAA TCAGCTTGCT ATTCCTGCCC AAAGCAGCCG CCCAATCACG GTTAATGTGC AGATTCCAGC AACCGCCCAG CCGCATACCA GCGATCAAAC GACCTTGCAA TTGAGCAATG GCTTAGTTGA ACTGAATCAA GCAATTGTGA CGAATTGTGC CCTTGGGCAG TGGATTGATC GCGAGGATTC GCTGGTGGCT GCGCGTTATT CCTCGGTTGT GAGTGCTGAT GGAGCGCTCT TCCAAATTGG CGGTCAAGGG CCAAATAATA ATTCGCCTGC CTTGGCGAAC ACTCTGCGCT ACCAGCCAAT CACGGGGAGT TGGCAACAAC GGGCGGCCAT GCTTACGCCA GTATTTGGTG CTGATGCCGC TACCCTCAAC GGCGAAATTT ATGTCGCTGG GGGCTATACC ACTGGTGGCT CGACCACGAC AGGGTTGATT AGCAGTTTAC AAATTTATTC GCCAACGCTC GATACTTGGC GCAGCGGCCC AAGTTTGCCA ATCGCATTGG CCTATTATCA ATCGGCAGTT GTTAATGGCA AACTGTATAT TATTGGTGGC TCGAATGGCA GCAATGCCTT AACCAGCGTC TGGATTTTCG ATCCTATTGC TCAAGTATGG AATGCTGGCT CAGCGCTGAT GAGGGCTCGG GCATTTGCTT CGGCTGGCGT GATAGGCAAT AAAATCTATG TTGCCGGCGG CACAGCTACA ATCAGCAATC AAACTGCCAT GGATACCATG GAAATTTTTG ATCCAAATCT TGGATTTTGG ATGCCTGCGC CCAACTTGCC ACGCCGTCAA ATGCAAGGTG GCGATGCTCA AATTCTTGAC CGCTTCTTCG TCATTACCAC GGGCTATTCA ATGCCAGTTG TCGCCTCGAA CTCAAGCCTG ATCTTTGATC AACAGACTAA TCAATGGTCA GAAGTATTGT TGAATAGCTC GCGTTATGGA GCCGAGGCCG ATAGCATCAA CGACACGGTG TTTGTGGTTG GCGGTCGTCA GTTTGCTAAC AATGTCTTTA CTATGAGCAG CCGCAACGAA TCATTCCAAA TCTGTCGATT TGTACTCACC ACCGCCACGC CAACGCCAAC GGCTACCGCC ACATCGACCG CAACGGCTAC CAACACGCCA ACCAACACCG CTACCGCTAC ACCGACCGCA ACGGCCACCA ACACGCCAAC CAATACAGCG ACGAATACGC CAACCAATAC CCCAACCGTT ACGTTAACGC CGACCAACAC ATCAACAGCG ACGGTTACCA ACACGCCGAC CAATACGCCA ACAGTAACGG CGACGGGTTC ACCAACCCAT ACTCCAACCA ATACGCCAAC CGCGACCCAA ACGCTTGACG TGCCCGATCT CTTCTTGCCG CTGGTTGGGG TGGAATTGCG CTGA
|
Protein sequence | MTTRTLRIVL LLGLVLLFFG GSYTQARELQ QPMPRSQTVL VPTTYINESF DSISFPPLNW STTIITSTDT PDPEWTYVTS ATRPTAQPHT GAGMAHFNSY STINGNAARL SVVLTPTTSV LRVSFWYYHS AIFPTSADTL LLQTSSDNQN YITRASYPRY RATDGWTQYQ LDLPFASAGQ PFYLGFLGIS DFGANLLLDD VLIQDTPPIE IFGTTSNQGC AGDTLLYPLS VRNNYPNAQT LDLNLAVSAW PSSLPFNQLA IPAQSSRPIT VNVQIPATAQ PHTSDQTTLQ LSNGLVELNQ AIVTNCALGQ WIDREDSLVA ARYSSVVSAD GALFQIGGQG PNNNSPALAN TLRYQPITGS WQQRAAMLTP VFGADAATLN GEIYVAGGYT TGGSTTTGLI SSLQIYSPTL DTWRSGPSLP IALAYYQSAV VNGKLYIIGG SNGSNALTSV WIFDPIAQVW NAGSALMRAR AFASAGVIGN KIYVAGGTAT ISNQTAMDTM EIFDPNLGFW MPAPNLPRRQ MQGGDAQILD RFFVITTGYS MPVVASNSSL IFDQQTNQWS EVLLNSSRYG AEADSINDTV FVVGGRQFAN NVFTMSSRNE SFQICRFVLT TATPTPTATA TSTATATNTP TNTATATPTA TATNTPTNTA TNTPTNTPTV TLTPTNTSTA TVTNTPTNTP TVTATGSPTH TPTNTPTATQ TLDVPDLFLP LVGVELR
|
| |