Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0312 |
Symbol | |
ID | 5732207 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 370945 |
End bp | 372111 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641277436 |
Product | hypothetical protein |
Protein accession | YP_001543092 |
Protein GI | 159896845 |
COG category | [R] General function prediction only |
COG ID | [COG4106] Trans-aconitate methyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0938935 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACTAA CACCAGAAAA TTATCATGAG CAGTTGATCG CGTTGGCGTT AACTGACGAT TTTGTGCGCC TGACCATGAG TGGTGCTGCC CGTGCCACCG ATCTACGTTG GCAACGGGTG GTGGTGCGGC CTGTGCAATT GAAACAAGGC CGCGCTTGGC AAGCAGCCTA TTTTGACCAG CGTCAAAATA TCACCAAAAA TTACGCAATC GAGCAAGCCA GCAGCGCCCT CGGTGAGATT ATTGCGATTC CGCTGAGCAA TATTACGCTG GAAACCACCA GCGAACGCAT CCAAATCCAA CGGAGCAAAA AGGGCAAAGT GATTATTAGT CGAGTCCGCA ATCAAGCGGC GGCTCCAGAT TTGCGCCATA ACCACGTTAA AGCCTTGCCT TTGCCCAGCG ATAGCCCTGA TGCCTATTTA CAAAAAACGG GCATTATGAC CAACGATGGG GTGATTCGTG CCAGCATGAG CAAAAAATAC ACCCAAATCA ATGAATTTTT GCGGGTGTTC GATGAGCTTG ATCTCAAACC CAGCCCTGAG CAACCGTTGC GAATTCTTGA TGCTGGCTGT GGCTCGGCCT ATTTGACCTT TGCGGCCTAT CACTATTTGG TCAACATTCG TGGCTTAGCG GCGGTGGTGA TTGGGGTTGA TTCAAACGAA TATTTAATTG CTAAATGTCG CGCTCAAGCA GAAGAATTGG GCTACACCGA TATGCAATTT ATCGCCATGC CCTTAGCCGA TTGGCAGCCT GAGCAGCAGC CAGATGTGGT GTTTTCGTTG CATGCCTGCG ATACCGCCAC CGACGATGCC TTGGCTTTGG CGATTCGCAG CCAAGCCCAA GCAATTTTGA GTGTGCCTTG TTGCCATAAA CATCTGACGC ATCAAATTCA AGCCGAGGTG CTCAACTCCA TGTTGCGCCA TGGCAGCATT CGCCAGCGCA CCGCCGATTT AGTGACCGAC AGCCTGCGGG CGCAACTGTT GCGGATCAAT GGCTATCGTA GCGAGATTAT TGAGTTTGTT GATGCAGAGC AGACTGGCAA GAATTTAATG ATTCGGGCTA TTCGTAGCAA AAAACCTGAT TCCAAAGCCG TGGCCGAATA TCAGGCACTT AAGCAATTTT GGGGTGTTAC GCCCTATTTA GAGCAGTTGT TGGGATCAGG CAACTGA
|
Protein sequence | MQLTPENYHE QLIALALTDD FVRLTMSGAA RATDLRWQRV VVRPVQLKQG RAWQAAYFDQ RQNITKNYAI EQASSALGEI IAIPLSNITL ETTSERIQIQ RSKKGKVIIS RVRNQAAAPD LRHNHVKALP LPSDSPDAYL QKTGIMTNDG VIRASMSKKY TQINEFLRVF DELDLKPSPE QPLRILDAGC GSAYLTFAAY HYLVNIRGLA AVVIGVDSNE YLIAKCRAQA EELGYTDMQF IAMPLADWQP EQQPDVVFSL HACDTATDDA LALAIRSQAQ AILSVPCCHK HLTHQIQAEV LNSMLRHGSI RQRTADLVTD SLRAQLLRIN GYRSEIIEFV DAEQTGKNLM IRAIRSKKPD SKAVAEYQAL KQFWGVTPYL EQLLGSGN
|
| |