Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3253 |
Symbol | |
ID | 5735121 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4115352 |
End bp | 4116611 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641280399 |
Product | hypothetical protein |
Protein accession | YP_001546018 |
Protein GI | 159899771 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000295877 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGCAG ACATTATTGC CAATCACTAC GAGGCTGTAG AACAAGCCGG AGTACGCTTC AAAAAATTGT ATGATTACCA AGTGCAGATG GAAACAATGA TTCGCCAGAT GTACGAGCAA CTTCGCAACG ATGGATGGCA AGGTACGGCT GCCCAAGCAT TCTTTGTTGA AATGACCGAT GTTGTGTTTC CTACCTTGCA ACGGTTGCAA AATACCTTGC TTGTGAGCAG TGAGTTGACC AAACAAATCA ATCAGATCTT CCGTCAAGCT GAAGAAGCGG CTGCAAAATG TATTACGTTT GATAGTGGTT CAATTGGTGG CATGCTGGCG GATGCTGGTC TTGCTGCTGG TCATGGGTTG GCGGGTGTGA GCATTGGTGC TGCGCTATTC GATGATACGT CAAAGCCGGA TTTACGCAAA ACAATCTTCA ACGACGATTA CATGAATCGT TTGGTTGGCT TAAAAATGCA AGGTATGGAT GTACCTGAAC TGAATAAAGC TATGGAAACC ATTATTAATG ACAAAGCTTC AGAAGCCGAT GTTCAAGCGG CTTTGGTCAA AATTGCTAAA TTGCGCGATG TGCCACTGGA GAAAATTCAG GCCGACCACA AGAGGTTTGT CGAATTACGG AAGCGAGCCA CCGAAAAAGG TGACGTTGAT CAACTTGATC AAGATCGACA TCCTGACTTT TTAGGCAGTA CCACAAGTTT ACGGTTTGGC AAAGTTCTTG GCGATGTCTT TGGTATCGAT CCAGTTTTTG GTTCATTGCT CAGTCCTACA GGCGGCTTAG TGGGTTACGG TAATATTGCG ATTGATGCTA AGGAACGTCC AGTTGGCTAT CACGGCATCA TGCACGACGC AGGTGGGTAT CTGCTCAAAC ATCATGAGAT GGGACCAGGC TATGATTACC TTGGCTTGGA AGGCCGTAAT CCCACTCACC CATTGACTGG TCAAGAGTCA GGCATTCGTT ATTGGAATCA GAAATTAGCC TATGGATTTA TTGATGGCGA TGGTATTAAT CTTAATAACC CCGTCCAAAA ATTGGCTGAA TACGCGGTTA TTGATGCTGT AGATCATAGA TTGGGTCAAA TTGTGGATGT TCATGAAGGG CTTAAAACCG TTAAAAATGC GGCGAACGAT GGCTGGAATG CCGCCAAAGA TGCGGCGAAC GATGGTTGGC ATGCAACCAA AGATGCGGCA AGTGATCGAT GGAATGCTGC TTCAAAATCA GCGAAAGATA TATTTTCTTT TCTATTCTAG
|
Protein sequence | MSADIIANHY EAVEQAGVRF KKLYDYQVQM ETMIRQMYEQ LRNDGWQGTA AQAFFVEMTD VVFPTLQRLQ NTLLVSSELT KQINQIFRQA EEAAAKCITF DSGSIGGMLA DAGLAAGHGL AGVSIGAALF DDTSKPDLRK TIFNDDYMNR LVGLKMQGMD VPELNKAMET IINDKASEAD VQAALVKIAK LRDVPLEKIQ ADHKRFVELR KRATEKGDVD QLDQDRHPDF LGSTTSLRFG KVLGDVFGID PVFGSLLSPT GGLVGYGNIA IDAKERPVGY HGIMHDAGGY LLKHHEMGPG YDYLGLEGRN PTHPLTGQES GIRYWNQKLA YGFIDGDGIN LNNPVQKLAE YAVIDAVDHR LGQIVDVHEG LKTVKNAAND GWNAAKDAAN DGWHATKDAA SDRWNAASKS AKDIFSFLF
|
| |