Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0870 |
Symbol | |
ID | 5732771 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 990358 |
End bp | 992295 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641278002 |
Product | ASPIC/UnbV domain-containing protein |
Protein accession | YP_001543646 |
Protein GI | 159897399 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTAGGA GCTTGTGGCA AAAATATCAC GGACGTTTGA TCGTTAGCCT TATCCTATTG ATCTGCTTTG GCTTAGCCCG AGAACCACAG TTATCGGCAG CAGAACGCAG CGAATTAGCC AAATCGTTTC AATTTACCCC CGCAATCCTG CCAACATTAA GTGGTTATCC CCAATCGACT ATTCGGACGG TCAATCCGAG CCTGGCCCAT ATTAGCGCTT GGATCTCATC GGTTGGAGCG GCAATCGCCA TCAACGATCT TGATGCTGAT GGGTTATCCA ATGATATTTG CTATGTTGAC CCACGCATTG ATCAAGTGAT TGTTACTCCC GTATCTCAGG CCAATTTGCG CTACCTGCCA TTTGCGCTTA ATCCCAGCCC ACGGCCCTAT AACCCAACCA CGATGGCCCC AATGGGATGT TTACCGGGTG ATTTCAACGA AGATGGTGTG CTTGATCTCT TGGTTTATTA CTGGGGACGC ACACCACTGC TGTTTTTCCA ACAACCAACC GATGGTTCGC TTACCGCTGA GCGATTCGTT GTCGAAGAAC TCATGAGCCA GTCCGAACGT TGGTATACCA GTGCAGGGTT ACTTGCCGAT TTCGATGGCG ATGGCCATCA AGATTTAATT CTAGGAAATT ATTTTCCCGA TGGAGCGCAG ATCTTAGATG AAAACTCGAG CAAGGCTGAG TCGATGCAGG CCTCAATGTC GCGAGCGTTC AACGGAGGAA ACAAACACTT CTTGCGCTGG ACTGCTCGTC CTGATCAACC ATTTGGGGTG CAGTTTATGC CGGTTGAGCA GGTGCTCGAA CGCGAACTTA ATCATGGCTG GACATTTGCA CTCGGTGCTG CTGACCTCAA CGGCGACCTG TTACCAGAAC TCTATATTGC TAACGATTTC GGCCCTGATC GGTTACTGCT TAATCAATCA ACGCCTGGCA AGCTAAAGTT TTCATTACTG GAAGGCCAAG CGGGGTTTAA TATCCCAAGC TCAAAAAGGG TCGGCCATGA CTCGTTCAAG GGCATGGGTG TTGAATTCAG CGATCTCAAC GGCGATGCTA TTCCAGATAT TCTTGTCAGC AACATTACGA CCAATTATGG ATTGCATGAG AGTAACTTTG CCTTTCTCAG CACTGGAGAT CAAACAGGTT TTACCCAAGG CCTTGCCCCT TACCGCGATC ATAGCGAGGC TTTGGGTTTA GCTCGCAGCG GCTGGGGATG GGATATTCGG GTGGCGGATT TTAATAACGA TACCAGTCTT GAGATAGTGC AGGCAACCGG ATTTGTTAAA GGAACGGTTA ATAATTGGCC TGAACTCCAA GAATTAGCAA CTGGTAATGA TGCCTTATTG GCTGATCCTG CTAGTTGGCC CAGCTTTCAA GCTGGCGATG ATATTGCTGG GCATCAAATT AATCCTTTTT TTGTTCGTGA TGCCACTGGC CGCTATCACA ATCTTGCCGC CGAGCTTAAT TTGGATACTC CTCATGTAAC CCGTGGCATT GCTACTGCCG ATGTTGATGG TGATGGACGC TTAGATTTTG CCTTAGCCAA TCAATGGGAA TCATCGTGGT TCTATCACAA TACAAGCCCG ATTAACACCA AAGCGTTGGG CCTTCGCCTA CGCCTACCGC TCGACGTTAA TCAGCCTTTG GCCGTGTTAT CAGGGTATCA GAGCGATCCG ACCCCAAGTT TGGCGGCAAT TGGCGCAAGC GCCACGATCA CCCTTCCCGA TGGTCGTACC TTGGTTGCCC AAGTTGATGG TGGTAATGGA CACTCCGGCA AACGCAGCTA TGATCTGCAT TTCGGTTTAG GTGAGCTGAA GGGTGATAGT CAGATTAATG TCGAAATTAC CTGGCGAACA AGGGATGGGG AGATTGTAAA AACGCGCCAA TTACTCAAAC CTGGAAATTA TACGATATTG CTCGGCTCGA CGAATTGA
|
Protein sequence | MIRSLWQKYH GRLIVSLILL ICFGLAREPQ LSAAERSELA KSFQFTPAIL PTLSGYPQST IRTVNPSLAH ISAWISSVGA AIAINDLDAD GLSNDICYVD PRIDQVIVTP VSQANLRYLP FALNPSPRPY NPTTMAPMGC LPGDFNEDGV LDLLVYYWGR TPLLFFQQPT DGSLTAERFV VEELMSQSER WYTSAGLLAD FDGDGHQDLI LGNYFPDGAQ ILDENSSKAE SMQASMSRAF NGGNKHFLRW TARPDQPFGV QFMPVEQVLE RELNHGWTFA LGAADLNGDL LPELYIANDF GPDRLLLNQS TPGKLKFSLL EGQAGFNIPS SKRVGHDSFK GMGVEFSDLN GDAIPDILVS NITTNYGLHE SNFAFLSTGD QTGFTQGLAP YRDHSEALGL ARSGWGWDIR VADFNNDTSL EIVQATGFVK GTVNNWPELQ ELATGNDALL ADPASWPSFQ AGDDIAGHQI NPFFVRDATG RYHNLAAELN LDTPHVTRGI ATADVDGDGR LDFALANQWE SSWFYHNTSP INTKALGLRL RLPLDVNQPL AVLSGYQSDP TPSLAAIGAS ATITLPDGRT LVAQVDGGNG HSGKRSYDLH FGLGELKGDS QINVEITWRT RDGEIVKTRQ LLKPGNYTIL LGSTN
|
| |