Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3251 |
Symbol | |
ID | 5735119 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4112784 |
End bp | 4114154 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641280397 |
Product | hypothetical protein |
Protein accession | YP_001546016 |
Protein GI | 159899769 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0229094 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGCAG ATATCGTCCA GCATAAGTCT GATGATATGG AACAAGTTTC CAGTCGGTTC CAAAAATTGC ATGATCACCA AGTGCAAATG GAAGCGATGA TTCGCCGCAT GTATGAGGAA CTTCGAGGTG GTGCTTGGCA AGGGGATGCA GCCCAAGCAT TCTTTGCCGA AATGAACGAG CATATCTTTC CTGCTATGAA GCGATTTCAA AATGTGCTAG CAGTGAGCGG CGAAGTTACC AAGCAAATCT CGCAAATCTT CCGTCAAGCC GAAGAAGAAG CCGCCAAAGG CATTACCTTT GATGGCAGCG GCGCACCATC TGGTGGTGGC AATGGCACGG CTAGTGCTGG TGGCGGCGGC GGTGGCAACG CCGATGGTAG TGGCAGTGGT GGCAGTAATG CTGGTGCTAC TGGTGGTGGT GCAGGTATTG CTGCTGGCGG CGGCGGTTCA TCGTCAGGTG GCGGTGGCGG CGGTTCATCG TCAGGCGGCG GTGGTGGCGG CTCATCGTCA GGTGGCGGTG GTGGCGGTTC ATCGTCAGGC GGCGGTGGTG GCGGCGGTAG TAGCACTGCT GGCGCAGGCG GCGGCGGTGG CGGTGGCGGC GGTGGTGGCA GCAGCCAAGA ACCAATGTCA ACCGAGCAAG TCTTCAATGA TAAATACATG GGCGATTTGG TCGGCCAACA ATTCCAAGGT GCTGGCAACC CTGAGTTGAA CTCGGCCATG GAATTGCTGA CCAGTGGCAA TGCAACCCCT GAGCAAATTG AAGAAGCACT CAAGAAGATT GCCGCAGCTC GTGGCGTGCC ATTAGAAAAA ATTCAAGCCG ACTATGGCAA GTTCCTTGAA TTGCGCGAAC AAGCTGCCAA AACCGGCGCA GCCAACGGCC AATCGGCTGT TGAAGCAATC AACCAAACCT TCCATGGCGA TTTCATGGGC AGCACCTCAA GCTTGCGCTA TGGTAAAGTC GTCGGCGATG TCTTGGGCAT CGACCCAGTA TTTGGTTCAA TGCTCAATCC AAGCGGTGGC TTGGTTGGCC CTGGCAACAA AGCAATCGAC TTAGGCGATT CACCAGTCAG CTATCACGGT GCTGTCCACG ATGCTGCTGG CTACCTCTTC AACTACCACG ATATGGGCCC AGGCTATAAC TACCTTGGCT TGGAACGCCG CGACACGGCC AACCCATTGA CTGGCCAAGA ATCTGGTATT CGCTACTGGA ACGAAAAAAT GGGCAACACT GGCATCGGCG CAACCATTAG CAACGGCGCT GGGAGCTTGA TTGGTAAAGC TCAAGATGCT GTCAACTGGT TTGGCGATGT CAAGCAAGAT GTCCAAAATA CCTGGAGCGG AGTCAAAGAT TGGTTCTCCA AAACCTTCTA G
|
Protein sequence | MTADIVQHKS DDMEQVSSRF QKLHDHQVQM EAMIRRMYEE LRGGAWQGDA AQAFFAEMNE HIFPAMKRFQ NVLAVSGEVT KQISQIFRQA EEEAAKGITF DGSGAPSGGG NGTASAGGGG GGNADGSGSG GSNAGATGGG AGIAAGGGGS SSGGGGGGSS SGGGGGGSSS GGGGGGSSSG GGGGGGSSTA GAGGGGGGGG GGGSSQEPMS TEQVFNDKYM GDLVGQQFQG AGNPELNSAM ELLTSGNATP EQIEEALKKI AAARGVPLEK IQADYGKFLE LREQAAKTGA ANGQSAVEAI NQTFHGDFMG STSSLRYGKV VGDVLGIDPV FGSMLNPSGG LVGPGNKAID LGDSPVSYHG AVHDAAGYLF NYHDMGPGYN YLGLERRDTA NPLTGQESGI RYWNEKMGNT GIGATISNGA GSLIGKAQDA VNWFGDVKQD VQNTWSGVKD WFSKTF
|
| |