Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4304 |
Symbol | |
ID | 5736163 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5495457 |
End bp | 5496521 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281464 |
Product | THUMP domain-containing protein |
Protein accession | YP_001547064 |
Protein GI | 159900817 |
COG category | [R] General function prediction only |
COG ID | [COG2933] Predicted SAM-dependent methyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTTGATA CCTATCGTGT TTTAATTAGT GCTGCGCCCG AGATGCTTGA GGCAGCGATA GATGAAATTC GTTATGGCAA ATATGGGGTT GCCGTTCAAC GGCTCGCGCC AGAAGTGGCC TTGGTTGAGT TAGAGCAACC ACTCGCTGAA TGGCACGCCT TCTTGCTCAA ACGGCCAGCC ATTTTTGTGC GCCATATCGC GCCAGTTCAT CGTGAAGTCG TGCTCGAAGG TACGCCTGAT GATTTAGAAC GCCTAACCCA AGTGGCCTTG AATCTTGCGC CACATTTGCC AGCCCGTAAA CCATTTGCGG TGCAAACTCG CTTGCTTGAT GAACATGTCA AATTAACCCG CTTTGAAATT AACCAAGCCT TGGCCGAAGC AGTTTCGCAA GTCAGCGGGC TACCCTTGGA TGTACGCAAT CCGCAGGGCG TGCTTTCGGT TGCTCTGAGC ATGGATCGGG CTTATTTGGG TGTTTCGACC GTGGAATTTA ATCGCTCGGC TTGGGCGGGT GGCGCACACC GCTTTGCCCG CGAACAAGGC CAATTAAGCC GCGCTGAATT TAAATTGCTC GAAGCCCAAG CCTTATTCAA CATCGAATGG CCCGACCATG GCCAAGCCTT GGATTTAGGG GCTGCGCCTG GTGGTTGGAC GCGCATTTTG CGGCGAGCAG GCCTGCCAGT AGTCGCGATC GACCCAGCCG ATTTGCACCC CAGCTTGCGC TACGATCAAG GCGTGCGCCA TTTGCAAATC TTGGCTCAGC GCTTTTTGCC TACCCAAGAG CGTTTTACCG TGATCGTTAA TGATATGCGG ATCGATGCGC GTGATTCGGC GTGGCTGATG GTTGATTCAG CCAATGCCTT GACCAACGAT GGTTTAGCGG TGGTGACACT CAAATTGCCG CATGAACATA GTGCCCAAAT CGCCTATCAC TCGCTGAGTA TTTTGCGCAC GGCCTATTGG GTGATTGGGG CACGTCAACT GTTCCATAAT CGCTCGGAAG TGACTGCTGT GCTGCGTAAA AAACATGAAA AACGCCCCAA ACCTGAGCAA TTGCCAAAAG ATTAA
|
Protein sequence | MLDTYRVLIS AAPEMLEAAI DEIRYGKYGV AVQRLAPEVA LVELEQPLAE WHAFLLKRPA IFVRHIAPVH REVVLEGTPD DLERLTQVAL NLAPHLPARK PFAVQTRLLD EHVKLTRFEI NQALAEAVSQ VSGLPLDVRN PQGVLSVALS MDRAYLGVST VEFNRSAWAG GAHRFAREQG QLSRAEFKLL EAQALFNIEW PDHGQALDLG AAPGGWTRIL RRAGLPVVAI DPADLHPSLR YDQGVRHLQI LAQRFLPTQE RFTVIVNDMR IDARDSAWLM VDSANALTND GLAVVTLKLP HEHSAQIAYH SLSILRTAYW VIGARQLFHN RSEVTAVLRK KHEKRPKPEQ LPKD
|
| |