Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0868 |
Symbol | |
ID | 5732769 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 986525 |
End bp | 988441 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641278000 |
Product | ASPIC/UnbV domain-containing protein |
Protein accession | YP_001543644 |
Protein GI | 159897397 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACAGA CCTTGTTGAA ATACCAAACC CAGTTGCTGG CGGTCGCGGT GCTAGTTGGC ACGTTTATCG TGGCCCAACC ACCAACGCTT TCACAGGCTG AGCAAGCCGA TTTGAGCCAA AGTTTTGGCT TTGCGCAGCA GCCCCTTGCC GCATTGGCGG GCCATACCCA GCGCTTCATC CGCCCAGTTA ATCCAAGTTT GGCCCATATC GATGCTTGGA TCTCGTCGGT TGGGGCGGCG ATTGCCCTCA ACGATCTCGA TAACGATGGC TTATCCAACG ATATTTGTTA CGTCGATACC CGAATTGATC AAGTGGTAGT ACAACCAGCC CAAACCAGCA ACGCGCGTTA TCCAGCTTTT GCGCTTGATC CAAACAGCCT TAAATACGAT GCCAGCACAA TGGCTCCGAT GGGTTGTTTG CCCAGCGATA TCAACGAAGA TGGCATGCTC GATTTGATTG TGTATTACTG GGGGCGCACG CCAATTATCT TTGTCCAACA ATATCAGGGA GCAAATGTCG ATCTCAGCAG CCAAAGCTAT GTCGCCCAAG AGCTTGTAAC CACAGGCGAG CGCTGGTTTA CCAACACTGG CTTGGTCAGC GATTTCGACG GCGATGGCCA TCAGGATTTG CTGTTTGCCA ATTATTTTCC TGATGGTGCG GCGATTCTCG ATGCCAAATC CAGCCGCAAG CAAACCATGC AAGCCTCGAT GTCACGTGCA TTCAACGGCG GCGATAAGCA CTTTTTCCTT TGGCAGCAAA CCAGCACCGA ACAAGCGCCA TTTGTGGCTG TGCCCGATGT GCTGAGCGGC GAGTTGAACC ACGGCTGGAC GTTAGCACTA GGAACCTACG ATTTCAACAA TGATCTGTTG CCAGAACTCT ACATTGGCAA CGACTTCGGC CCCGATCGTT TTTTAATCAA CCGCTCAACG CCTGGCACGA TCAAATTAGA ACTGGCTGAG GGCAGCGGGG GCTTCACGAT TCCCACATCC AAAGTGATTG GACACGATTC GTTCAAGGGC ATGGGCGTTG ATTTCAGCGA TATTAATAGC GATCAGCACC CCGATATTTT TGTGAGTAAC ATCACCACAC CGTTTGGTTT GCACGAAAGC AATTTTGCCT ATGTCAGCGA TCCCAGCGCC AAACTCGATC AAAGTGAATT GCCCAATTAC ACCGACCAAA GCGAGCAACT AGGCTTTGGG CGCAGTGGCT GGGCTTGGGA TATTAAGCTA GCCGATTTCA ATAACGACCA GCGCGACGAA ATTTTGCAAG CAACCGGCTT TGTCAAAGGC ACAATCAATC GTTGGCCCGA ACTGCAAGAA TTGGCAACAG GCAACGATCA ATTGCTCGCC GACCCCGCCA GTTGGCCGCG CTTCAGTGCT GGCGATGATA TCGCGGGCCA TCAAATTAAC CCATTTTTTA GCCAAGCCGC CGATGGCCGC TACTACGACC TTGCCAAAAC CCTAGGCTTT GCGCCAACCG TCAGCCGTGG CATTGCAGTT GGCGATGTTG ATGGCGATGG TAAGCTTGAT TTTGCCTCAG CCAACCAATG GGAAGATTCG ATCTTCTACC ATAACACCAG CCAAAGCAGC AATCAAGCGC TTGGTTTACG CCTGCGGATC GCCAGCGATG GCAAGGCCAG CAATATTACA GGCTTTCAAC CAACCAGCGC GGCGATTCCG GCAATTGGAG CGCATGTCAC GGTCAAATTG CCCGATGGTC GCACGGTTAG CAGCCAAGTT GATGGCGGCA ATGGACATTC AGGCAAACGC AGCTACGATT TGCACTTTGG CTTAGGGGCA CTTGATCCAC AAACCCAACT TGAAGTAACG GTGCGCTGGC GTGGCCGCGA TGGCAGTGTT CAAACCAGCG TCGTCCAACT CACACCAGGC AACCATACCT TGATACTTGG CCAATAA
|
Protein sequence | MKQTLLKYQT QLLAVAVLVG TFIVAQPPTL SQAEQADLSQ SFGFAQQPLA ALAGHTQRFI RPVNPSLAHI DAWISSVGAA IALNDLDNDG LSNDICYVDT RIDQVVVQPA QTSNARYPAF ALDPNSLKYD ASTMAPMGCL PSDINEDGML DLIVYYWGRT PIIFVQQYQG ANVDLSSQSY VAQELVTTGE RWFTNTGLVS DFDGDGHQDL LFANYFPDGA AILDAKSSRK QTMQASMSRA FNGGDKHFFL WQQTSTEQAP FVAVPDVLSG ELNHGWTLAL GTYDFNNDLL PELYIGNDFG PDRFLINRST PGTIKLELAE GSGGFTIPTS KVIGHDSFKG MGVDFSDINS DQHPDIFVSN ITTPFGLHES NFAYVSDPSA KLDQSELPNY TDQSEQLGFG RSGWAWDIKL ADFNNDQRDE ILQATGFVKG TINRWPELQE LATGNDQLLA DPASWPRFSA GDDIAGHQIN PFFSQAADGR YYDLAKTLGF APTVSRGIAV GDVDGDGKLD FASANQWEDS IFYHNTSQSS NQALGLRLRI ASDGKASNIT GFQPTSAAIP AIGAHVTVKL PDGRTVSSQV DGGNGHSGKR SYDLHFGLGA LDPQTQLEVT VRWRGRDGSV QTSVVQLTPG NHTLILGQ
|
| |