Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0996 |
Symbol | |
ID | 5732899 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1139486 |
End bp | 1140466 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278130 |
Product | hypothetical protein |
Protein accession | YP_001543772 |
Protein GI | 159897525 |
COG category | [S] Function unknown |
COG ID | [COG4301] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR03438] probable methyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAAA CTCAACAAGT AGCCAAAGTA CGCTTGCTTG ATGCCGCCCC CACCATAGCC TGCTTCCGCA GCGAAGTGCT TGAGGGCTTG CGTCAACCAA TCAAAACCTT GCCCTGTAAA TTTTTCTACG ATGCTGAAGG CTCGCAGATT TTTGATGCCA TTTGTGAATT AGCCGAATAT TACCCAACCC GTACTGAGCT GGCAATTTTG CAGCAAGCCA TGCCTGCAAT TCGCCACTGG GTTGGGCCTG ATGTCCGGTT AGTTGAATAT GGCAGCGGAG CCAGCCGCAA AACCCGTTTG CTGCTCGATC AACTTGAAGC ACCAGCGGCC TATCTGCCAA TTGATATTTC ACGCGAACAT TTGCTCGCCG CCAGCCAAGA TTTAGCCGAG CGCTATCCAG CAATCGAAAT TTTGCCAATA TGTGCTGATT ACACTCAGCC CTTGAGTTTG CCCCAGGCTC AACGCTCAGT TGGTCGCACA GTGGTGTTTT ACCCAGGCTC GACAATTGGC AATTTTCACC CTGACGAAGC GTTGAGCTTT TTAACAATGA TGCGGGAGCT GTGCTTGCCT GATGGTGGCG TGCTGCTTGG CGTTGATCTC AAAAAAGATC CGAGCCTGTT GCATGCAGCC TACAACGATA CAGCCGGAGT GACTGCAGCA TTTAATCTCA ATCTGCTGGC GCGAATTAAT CGTGAGCTTG ATGCCAATTT CGCGCTAGCC AACTTTCGCC ATTATGCCTG CTACAACCCA ATTCAAGGCC GAATCGAAAT GCACTTAGTC AGTTTGCATG ATCAAACGGT GTGGATTGGC GATCAGCAAA TTGACTTTCG CCGTGGCGAG CCAATTTGGA CTGAATGCTC CTACAAATAT CATCTGCAAG AATTTGCCGC TTTGGCAGCC CAAGCCCAAC TTGCGGTAGC CGAGGTTTGG ACAGACCCCC AAAACCTATT CAGCGTGCAC TATTTACGCC CGATTGCTTA A
|
Protein sequence | MSETQQVAKV RLLDAAPTIA CFRSEVLEGL RQPIKTLPCK FFYDAEGSQI FDAICELAEY YPTRTELAIL QQAMPAIRHW VGPDVRLVEY GSGASRKTRL LLDQLEAPAA YLPIDISREH LLAASQDLAE RYPAIEILPI CADYTQPLSL PQAQRSVGRT VVFYPGSTIG NFHPDEALSF LTMMRELCLP DGGVLLGVDL KKDPSLLHAA YNDTAGVTAA FNLNLLARIN RELDANFALA NFRHYACYNP IQGRIEMHLV SLHDQTVWIG DQQIDFRRGE PIWTECSYKY HLQEFAALAA QAQLAVAEVW TDPQNLFSVH YLRPIA
|
| |