Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2998 |
Symbol | |
ID | 5734870 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3786122 |
End bp | 3787315 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280142 |
Product | hypothetical protein |
Protein accession | YP_001545764 |
Protein GI | 159899517 |
COG category | [R] General function prediction only |
COG ID | [COG1092] Predicted SAM-dependent methyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCCGCCC GTTCAATGCC AACAATCAAC TTGCCAACCC AGCTAAAGGC TCATCTTCAG GCCGGCCATC CATGGGTCTA CCGCGACCAT GTGCCGCCGA GTACTCGTTT AGCCAGCGGC ACATGGGTGC GGCTGCATTG TGGCAATTGG CAAGGGTTTG GCTTGTGGGA TGCTCGCTCG CCGATCGCTC TACGCATCTT TTCGAGCCGC ATGCAGCCCG ATGCCAACTG GATCAAAACG GTCGTTCAGC AAGCATGGCA GGCACGTGAA CCGTTGCGCC AAACTGCCAC CACCGCCTAT CGTTTGCTGT TTGGCGAGGG CGATGGCCTA CCAGGGATCA CCATCGATCT CTACAATCAA TATGCCGTGA TTGCGACCTA CGCCGATTGT GTTGAGGTAT TGATTGCCGA TGTGGTCAAG GCCTTGCAAG CCAGCGTGCC ACAACTGCGG GGCGTGGTGC GCCGCCGCCG CGACGATAGC GAAAACGACG ATGAAACTGG CAAAATTGAG TTGTTGTGGG GCGAATTGCC ACCAGCCCAA CTGATAGTCG AAGAACATGG CCTTAAATTG ATCGCTAATT TGTTTGAGGG CCAGAAAACT GGCTTATTCC TTGATCATCG TGAGAACCGC CATACCATTG AGCAATGGAG CCATGGTAAA ACGGTGCTGA ATTGCTTCTC GTATACTGGG GCATTTTCGT TATACGCTGC TCGTGGCGGC GCAACTGCCA CCACCAGCGT CGATATTGCG CCAGCCGCTG CCCACGATGC TGAACAAAAT TTTATGCTCA ATGGCTTGAT GAATGAACAC CAGCGCTTTT TGGCCCGCGA TTGCTTTGAT TTTCTGAGTC GCACGATTCA GCGTGGCGAA ACCTATGATT TGGTGATTCT TGACCCACCT TCGTTTGCCC GCTCGAAGAA AAATATTCAT GCAGCAACTC GAGCTTATGT CAAACTCAAT GCCTTAGCGA TTCAATGTGT GGCGAAGGGT GGGCTACTGG CCTCAGCCAG TTGTACTAGC CAACTTTCGC CCGCCAATTT TCGCTTGATG CTGGGCGAAG CTGCTGCCCA AACCGATCAG CAATTGCGCA TTATTCATGA GGCAGGGCAA GCGCTCGATC ACCCAGTGCC AGCGCATTTT ACCGAAGGCC GCTATCTCAA ATTTGTGTTA GCCCGCGTTG ATGAGCGTAT GTAA
|
Protein sequence | MAARSMPTIN LPTQLKAHLQ AGHPWVYRDH VPPSTRLASG TWVRLHCGNW QGFGLWDARS PIALRIFSSR MQPDANWIKT VVQQAWQARE PLRQTATTAY RLLFGEGDGL PGITIDLYNQ YAVIATYADC VEVLIADVVK ALQASVPQLR GVVRRRRDDS ENDDETGKIE LLWGELPPAQ LIVEEHGLKL IANLFEGQKT GLFLDHRENR HTIEQWSHGK TVLNCFSYTG AFSLYAARGG ATATTSVDIA PAAAHDAEQN FMLNGLMNEH QRFLARDCFD FLSRTIQRGE TYDLVILDPP SFARSKKNIH AATRAYVKLN ALAIQCVAKG GLLASASCTS QLSPANFRLM LGEAAAQTDQ QLRIIHEAGQ ALDHPVPAHF TEGRYLKFVL ARVDERM
|
| |