Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3585 |
Symbol | |
ID | 5735446 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4508505 |
End bp | 4510814 |
Gene Length | 2310 bp |
Protein Length | 769 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280734 |
Product | hypothetical protein |
Protein accession | YP_001546349 |
Protein GI | 159900102 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.717748 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTCGTT TAGTGCGCGT TTTTCTGCTT TTTTCCGTGG CGCTGGGTCT TTTTGCCAGC TTGCCAATGG TTTCACAGGC TGCTCGCACC AGCCTTAATC CTATTTCAGC AACAACTGCG GCAGCCCCGC TTAACCAAGC AACGGTTACG CCGGTAGTTA CCGATAGCGA TACTGTGACC TTTGCCCAGC TTGGCGCTCG CGAAGATTCG ATGGAAGGGC CATTCAATGC CCGTTATATC AACATTCGCT TGCCAAATAC CTGGAAATTA GAGCCAGGCG CAGCGATTAC GCTCGATTTT GAAACCTTCG TTTCAGGCAG CGCCGTCAGC AGCGAAGAAA ACGTGCAGTT TTTTGCTGGC TCGATGGATG TGCAATTTAA CAATGTGCAG CTTGAACGAA TTTTCTTAGA TCGTTCAGGC CCGCGCCAAG TGACCATCCC CATTTCAGCC ACCGCCTATA CCTCAACCCG CATCGATGGT GCTCACACTC TGGGCATTTT GCTCGATGCA GGCTTAAATT GTGGTAACGA TAGCCAAACC AGCGTGATCA TTCGAGCAAC CTCATCGCTT AAATTGCCAC ATACCTTGGG TACGCTATCA ACCGATGTTA AGCAACTGCC CCGCCCAATC TTCCAAGATT CACCGCTGGA GCCAGATGTT GCCACGATTA TTGTGCCCAA CAACGCCAGC GCTGCCGAGC TTGAAGCAGG CTTGATTGTG GCCGCGAAAG TTGGCAGCAT TACCTCAAGC CGGATTCAAG TGCCGTTAAT CGCCGAAAGT GTGCTTTCAC CAACCTTGCG CACCGAATCG CACTTGATTT TCGTGGGCCA AGCTGCCAAT TTCGAAAATT TGGATGCTGT GGATTTTGCT GAGGGCAGTG GAGCCGAGGG CTTTAATCTG AGCGAAGCTC AAGCCGCCGA TGGCATTATT CAAATGGCAA CCTCGCCTTG GAACCAACAA CGGGTGGTTT TAGCCCTCAG CGGCAACGAT GACGAGGGCG TGATCAAAGC TGCCAAGGCA TTTAGCACCG GCAACATTCG TGCTAGCAGC GAGCCTTCGA TTGCTGTAGT CGCTGAAGTT AATTCGCCCG ATGTCTTGAC CAAAACCTTG AATTTGCAAA TTACCGATGT GCCAAGCACT GGCTTGACGG TCGAACGCAG CTTCCGTAGC CTTGGCTTCG AAACGGTTTC AGAGTTTGGA ATTGGTGGGC ATACCTTTGA ACTGAATGTT GATATGCCCA GCGGCTATGA AATGAACGAC GATGGCTATA TCGAAGTCCA TTTTGCCCAC TCGGCACTGC TCGATTACTC AGTTTCAGCG ATGATGGTGC GCGTCAACGA TCGCCCAGCA GGCTCGATTC GCTTCGACGA TACCACCGCC CAAAATGGGG TCGGTCGCGT GCCGATTGCA CGGGGCAACT TGAATGTTGG TCGCAACCGA ATCACGATTA GCGCCAACTT GATTCCCAAT ACGCCTTGTG TTGATCCCAA CCTTGCGGGG ATTTGGTACA CGCTACGAGC TGATTCGATG ATTGGGATTC CAGTGCGGGC CACGCGTGGG CGCTCGGTGG TGCTGCGCAA TTTAGCAACC TTCCTTGAGC CATTGACGAT TAGCAATACT TTGAAAAATT TGGCCTTTGT TGTTCCAAGC GATGATCCAG CCAGTTGGAC GGTCGCAGGC CAACTCGCCA CGGCCTTGGG CGATAGCCTT GACCCAGCTT TTGTCGAATT ACGCGCGGTC AATCCGGAGC AAGTTGCTAG CGTCCAAGCA AATCATGATT TGATTTTGGT TGGTCAAGCA CCTGATCATG CTATTTTGCA AGATCCAGCG TTGCAGCCAA TTATTCCTGC CCCGTTTGCC CAAGGCAGCA AGCATCCAAC CCTTGGCAAT AGTCGGGTGG TCTATCGAAT TCCAGCTGAA TTGAGCCTTG GTTACTTAGA ATTTATGCGT TCGCCATGGA ACGAGCAGCG CAGCATTTTG GCGGTGCTTG GCAGCACCGA TCAAGGCGTG CAATGGTCGG GCAATGCCTT GACGACTTCG CGCCTGCGCA CGCGGCTCAA CGGCAGCTTG GCAATCGTCA ACGACCAACA GATCAGCGTT GAAGATCCGA CGGTTACTGG CGATACTGCG GGGATTGCCA ACGATGCCCT CGGCGAAAAC CAAGATGACC CGATCTATCG TGAAGTTATA CCACCTACAA AGCCCGAATG GATTTTGTAT GCCATCGCGG GTCTGTTGGT GGTGATGGTG ATCATTTCAG TGATTGTCAT CATTCAAGCA GCCCGCAAGC GCAAGCAACG CCGAATTTAG
|
Protein sequence | MVRLVRVFLL FSVALGLFAS LPMVSQAART SLNPISATTA AAPLNQATVT PVVTDSDTVT FAQLGAREDS MEGPFNARYI NIRLPNTWKL EPGAAITLDF ETFVSGSAVS SEENVQFFAG SMDVQFNNVQ LERIFLDRSG PRQVTIPISA TAYTSTRIDG AHTLGILLDA GLNCGNDSQT SVIIRATSSL KLPHTLGTLS TDVKQLPRPI FQDSPLEPDV ATIIVPNNAS AAELEAGLIV AAKVGSITSS RIQVPLIAES VLSPTLRTES HLIFVGQAAN FENLDAVDFA EGSGAEGFNL SEAQAADGII QMATSPWNQQ RVVLALSGND DEGVIKAAKA FSTGNIRASS EPSIAVVAEV NSPDVLTKTL NLQITDVPST GLTVERSFRS LGFETVSEFG IGGHTFELNV DMPSGYEMND DGYIEVHFAH SALLDYSVSA MMVRVNDRPA GSIRFDDTTA QNGVGRVPIA RGNLNVGRNR ITISANLIPN TPCVDPNLAG IWYTLRADSM IGIPVRATRG RSVVLRNLAT FLEPLTISNT LKNLAFVVPS DDPASWTVAG QLATALGDSL DPAFVELRAV NPEQVASVQA NHDLILVGQA PDHAILQDPA LQPIIPAPFA QGSKHPTLGN SRVVYRIPAE LSLGYLEFMR SPWNEQRSIL AVLGSTDQGV QWSGNALTTS RLRTRLNGSL AIVNDQQISV EDPTVTGDTA GIANDALGEN QDDPIYREVI PPTKPEWILY AIAGLLVVMV IISVIVIIQA ARKRKQRRI
|
| |