Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1249 |
Symbol | |
ID | 5733127 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1457794 |
End bp | 1459932 |
Gene Length | 2139 bp |
Protein Length | 712 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278389 |
Product | PKD domain-containing protein |
Protein accession | YP_001544025 |
Protein GI | 159897778 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.8229 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCGTT GGATTATTAT CGTTGTTCTG ATCTTTTCAG GTTTTTCTGC CCAATTTGTT CAACAATCGA ACCCTGAGCT TTCCCCTGCT GCTGTTCCTG CTGGCTTCAC CCAAACTGTG GTTGTACCCT CGAATAGCCT TGATGCTCCA ACTGCCTTTA CTTGGTTGCC CTCTGGTGAG ATGTTGATTA CTCAGCAAAA TGGTCAGTTG TTGGGTTGGA ATGGCACAAG CACGCGCACC GTAATGAGCC TTGGCAATCG AGTTTGTTAC GATTTTGAGC GTGGTTTGCT GGGCATCGCG GTTGATCCCC AATTTACCAG TGGTCGTCCA TATGTCTATG TCTACTATAC CTTTAATAAA TTTAACCAAA CTTCGAACAA TTGCCCTCGC CAAAGCCCCA GCACCAATCC GGTCAATCGA GTTTCACGCT TTACTTGGAG TAATAATGTG CTCGATATCA ACTCGGAATC GGTCTTGATC GACAATATTG GCTCATATAA CGGCAATCAC AATGCTGGCG ATCTTGGCTT TGGCAAAGAT GGCAAGTTGT ATATCAGCGT TGGCGATGGC GGTTGCGATT ATCTCGATAG TGGCTGTGGT GGCGCAAACG ATGCCTCGCG CGAACAGCAC ACCTTGCTTG GCAAAATTTT ACGCATCAAT GCTGATGGTA CGATTCCTAG CGATAATCCG TTTACTGGCA GCGGCACGGC CCGTTGTAAT ACTGGCTCAG TGGCTAGCGG CACGATTTGC CAAGAAACTT GGGCTTGGGG TTTTCGCAAT CCCTACCGTA TAACCTTCGA TCCCAATGCT AGCGGCGTGC GCCTGTTTGT CAACGATGTT GGCCAAAATG TGCGCGAAGA AATCGACGAA GTTGTGGCGG GCAAGGATTA TGGCTGGAAT TGTCGCGAAG GTACGCGGGT CAATAATTCA ACGGGGCCAT GTTCGCCAAC GCCCGCCAAT ATGGTTGACC CAATTTATGA ATATAGCCAT GGCAACGCTG GCGCACCATT TACCAACTGT AATTCGATCA CTGGTGGCGC GTTTGTGCCT GCCAATACTT TTCCTAGCAA TTACAGTGGT TATATGTTTG GCGATTATGT TTGCGGCAAG ATTTTTATGA TTTCAGCCCA AGCGCCCTAC AATTCGGTTC TAACTTTCTC AGATGATCCT GGATCAGTCA CGCATATGGC GTTTGGCCCG AATGGCGGTC GCCAAGCGTT ATTCTATGCG ACCTATGCTA ATGGTGGCGA GATTCGCCGA ATTAGCTATG ATGGTAGCAC CAGCTTGAAT TCTTCGTTTA CAGCCAACCC CAGTTTTGGC GCGGCTCCCT TGGCCGTAAC CTTTACCGCT AGCAATCCAA GCAGCGGCGC AAGCTATTTG TGGAATTTTG GCAATGGCAC GAGCCGCGAA ACTAGCACGG CCAGCACATC CTACACTTAC GCCAACAATG GCACCTACAC TGCAACCCTG TATTTGCGCG GCAGCAATGG CGATTTATCG AATGTGAGCC AGGCGATTGT GCGGGTTGGC GCAACTGCGC CGAACGCCAG CATCACCCAA CCAAACTCAA GTGCCCAATT TGCGGTTGGC CAAACAATCC AAGTGCGGGG GCAGGCCAGC GATGCCGAAC AAGGCCAATT GCCAGCCAGC GGTTTATCGT GGAAAGTAAT TTTGCATCAC GATACCCATA CCCACCCCTA TTTGACCCAA CCAACCACCA ATAGTTTTAG CTTTACTGCG CCAGCCCCCG AAGATTTATT GGCGGCCAGC AACAGCTATT TGGAGCTTGA GCTGACTGCG ACTGACGATA GTGGCTTGAG CCATGTTGTT ACCCAAACTA TCCAGCCCCA TAAAGTCAAT GTGACCTTGG CCTCAACCCC GAATGCTAAC GCCAACTTTG TGGTCAACAA CGACCCGATC GAAGCTGATG ATCCGTTTAT TTCGTGGGAA AACTACAGCC TACGCGTGAC TGCACCAGCC TATGCTGATA GCAATCGTTG GTGGCGTTTT GTGCGTTGGA GCGATAACAA CACCAGCAAT CCGCGCACCT TTACGACACC TGCCAGCGCT ACGACTTACA CCGCGGTCTA CGAAGAATTT ATCCCCTATC AGCTCTATTT GCCAGTTGTG CGCAAGTAA
|
Protein sequence | MSRWIIIVVL IFSGFSAQFV QQSNPELSPA AVPAGFTQTV VVPSNSLDAP TAFTWLPSGE MLITQQNGQL LGWNGTSTRT VMSLGNRVCY DFERGLLGIA VDPQFTSGRP YVYVYYTFNK FNQTSNNCPR QSPSTNPVNR VSRFTWSNNV LDINSESVLI DNIGSYNGNH NAGDLGFGKD GKLYISVGDG GCDYLDSGCG GANDASREQH TLLGKILRIN ADGTIPSDNP FTGSGTARCN TGSVASGTIC QETWAWGFRN PYRITFDPNA SGVRLFVNDV GQNVREEIDE VVAGKDYGWN CREGTRVNNS TGPCSPTPAN MVDPIYEYSH GNAGAPFTNC NSITGGAFVP ANTFPSNYSG YMFGDYVCGK IFMISAQAPY NSVLTFSDDP GSVTHMAFGP NGGRQALFYA TYANGGEIRR ISYDGSTSLN SSFTANPSFG AAPLAVTFTA SNPSSGASYL WNFGNGTSRE TSTASTSYTY ANNGTYTATL YLRGSNGDLS NVSQAIVRVG ATAPNASITQ PNSSAQFAVG QTIQVRGQAS DAEQGQLPAS GLSWKVILHH DTHTHPYLTQ PTTNSFSFTA PAPEDLLAAS NSYLELELTA TDDSGLSHVV TQTIQPHKVN VTLASTPNAN ANFVVNNDPI EADDPFISWE NYSLRVTAPA YADSNRWWRF VRWSDNNTSN PRTFTTPASA TTYTAVYEEF IPYQLYLPVV RK
|
| |