Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3299 |
Symbol | |
ID | 5735169 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4163946 |
End bp | 4164974 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641280446 |
Product | SCP-like extracellular |
Protein accession | YP_001546063 |
Protein GI | 159899816 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000136792 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGTGG CATCTACTTG GCGGCGCAGC CGCTGGATCA GTTTGGTTGT AACAAGTTTG TTGGTATTTG GCTTGGTGGC CCCACTTGCA GCCCAAGAAC CCCAAGCTAT CGCTGGCGGC AGCATGAATT ATCTGCCTGC TGTGATGTCA AACCCTGAAC TTGATATCAA TAACCGCCAA TCGGTGGTCA ATTTCTATAC CAATTTCTAC TTGCCAAGCG AAGGCGTAGC CGATGGCTGG ACTGGCAGCC AAGCCAGTTG TAACGCTGGC ACAACTAGCC AAGCCTATCG CGATGCAATT CTGTTGCGAG TTAATTATTT TCGTAAAATG GCGGGCATGG CCGCAGTCAC CCTCAACGAG AGCTACAATA GCCAAGCCCA ACAAGCGGCC TTGATGATGA GCCGCAACAA TAGCCTCAGC CATAGCCCAC CAACGTCATG GGCTTGCTAC ACCGCCGCAG GTAAAACTGC CGCAGGTAAA TCGAATCTCT ATTTGGGTCA AATTGGGCCA GGTGCAATTA CGGGCTATAT GCTTGATCCT GGGGCTGGCA ATAGTGCCGC TGGTCACCGC CGCTGGATTC TATACCCGCA AAATAAATCT ATTGGGACTG GCGATGTGCC CAGCCGTAGC GGTTATGCTG GCTCAAATGC ACTCTACGTG ATCACCAGTG ATTTTGGCAC GGCCAGACCT GCCACCCGCA CCGAATATGT GCCATGGCCG CCAGCAGGTT TCGTGCCCTA TCAAGTGGTG TTTGGGCGTT GGTCGTTTGC CTTGCATAAC GGCAATTTTA GCAATGCAAC CATCACAATG AGCCAAAATG GCACGAATAT TCCCGTCACC AAAGAAACGG TTGCCACAGG CTTTGGCGAA AATACGATCG TTTGGATTCC CCAAGGCTAC AACCATAGCT CAAGTTGGGC GCAGCCAAGC GCCGATACCA CCTACACAGT CACGATCAGC AATGTTGTGG TTAATTCGCA AAATCGCTCC TTCACCTATA ATGTAACCGT GATTAATCCA AATCAATAG
|
Protein sequence | MQVASTWRRS RWISLVVTSL LVFGLVAPLA AQEPQAIAGG SMNYLPAVMS NPELDINNRQ SVVNFYTNFY LPSEGVADGW TGSQASCNAG TTSQAYRDAI LLRVNYFRKM AGMAAVTLNE SYNSQAQQAA LMMSRNNSLS HSPPTSWACY TAAGKTAAGK SNLYLGQIGP GAITGYMLDP GAGNSAAGHR RWILYPQNKS IGTGDVPSRS GYAGSNALYV ITSDFGTARP ATRTEYVPWP PAGFVPYQVV FGRWSFALHN GNFSNATITM SQNGTNIPVT KETVATGFGE NTIVWIPQGY NHSSSWAQPS ADTTYTVTIS NVVVNSQNRS FTYNVTVINP NQ
|
| |