Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3554 |
Symbol | |
ID | 5735413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4470437 |
End bp | 4471396 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280701 |
Product | endonuclease/exonuclease/phosphatase |
Protein accession | YP_001546318 |
Protein GI | 159900071 |
COG category | [S] Function unknown |
COG ID | [COG3021] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCAAAC GTGGCTTGAT CTATTTGGCG TGGCTGGGCG TTGTGCCATG CAGTTTGTGG TCGCTTTTAC GCTGGACTGC GCTGGCAACT ACACCGCAAG GTATGGTATT GCTGGTGTTT GATGCCTGGG TGTATGCCAG TTTATTGCCG ATTGCGCTGC TGGCGGTTAG TTTGCGTCGC AATTATTTGC TGGGCGTGCT CGGCTTGAAT ATGCTGGTAG TTTTGGGCTT ATATGGCGGG CGTTGGCTAC CCCAATCAGC GAATTCGACC CCCCAAGATT TGCGGATTAT GACCTGGAAT GTGTTTTACA ATACCCAAGA TATTGCTGGC TTGGCGGCGA CAATTCGTCA GCAACAGCCG GATATTGTAG TTTTGCAGGA ATATAATTTT CAGCTTGACC CGCAACTCCC CGAAGCCTTG GAAGATTTGT ACCCTTATGC CGCGCTTGAT CCCCATTCGG GCGCTGGTGG TTTGGCGACG CTTAGCCGCT GGCCGCTGCG CGAGTTGGCT CCGGTTGCGC GTGGTGTTGA TAGTTGTGGC TGCCAATATT TGGAGATTGC CACTCCAAAC GGCCCAACCC GCCTGATCAA CACCCATCCG CATATTCCGC TGGCCAGTTT CAAGGGCATT TATACCAAAA CCCAGCAAGA CCCAACCTTC GATCATTTGC TGAAATTGAT TGCTGACCAA AGCCAACCCT TAATTTTGGC AGGCGATTTG AATACGACTG AACGCCAGCC CAATTATTTA CGTTTACGTC AACAGCTTGG CGATGCCTAC CAACAACAAG GCTGGGGTTT GGGCTATACC TTTCCGAGCA ATGGCGCTAT TCCCAAAGCG GTGCGTTTGG ATTATATTAT GCCCAACGCC CATTGGCAGC CCTTGCGGGC TTGGAATGGC ACGGCTAATT TGTCGGATCA CGGCTTTGTG GTCGCCGATT TGCAGCGTTT AGCCGAATAG
|
Protein sequence | MIKRGLIYLA WLGVVPCSLW SLLRWTALAT TPQGMVLLVF DAWVYASLLP IALLAVSLRR NYLLGVLGLN MLVVLGLYGG RWLPQSANST PQDLRIMTWN VFYNTQDIAG LAATIRQQQP DIVVLQEYNF QLDPQLPEAL EDLYPYAALD PHSGAGGLAT LSRWPLRELA PVARGVDSCG CQYLEIATPN GPTRLINTHP HIPLASFKGI YTKTQQDPTF DHLLKLIADQ SQPLILAGDL NTTERQPNYL RLRQQLGDAY QQQGWGLGYT FPSNGAIPKA VRLDYIMPNA HWQPLRAWNG TANLSDHGFV VADLQRLAE
|
| |