Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0578 |
Symbol | |
ID | 5732299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 664744 |
End bp | 665934 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641277705 |
Product | hypothetical protein |
Protein accession | YP_001543354 |
Protein GI | 159897107 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1769] Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) |
TIGRFAM ID | [TIGR01888] CRISPR-associated protein, Cmr3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.191176 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACTTG ATCTCTGGCT AATTCAGCCA TTCAACTCCT TGGTAATTGC CGATGGTCGC CCATTTGAAA ACCAGCCAGG TGCCCGCGCC AAAAGCCTCA GTTTCCCACC ACCGTCGGTC ACGGTTGGCG GCTTTCGGGC ACGGGCTGGA GTGAAAGCCA ACTACAATTT TTCCGCCAAA TTAAGCGATC CTGAATATAC CAATCTCTTA GCGATGCAGG TACGTGGCCC GATTCTGCTC AAAGAAACCG TTGATTTGTT GATTCCATTT GTTGCAATCC CTGCCGATGC CTTGATTCGT CAGGATGGGG CAAATGTTAC CTGTTATCGC TTACAACCAC TCGACCCACG CTATTTCCCG ACTGAGCAAA CTGATCTTGT GCTTGGGCAG CAGCCTGATC AACCAAGGTT TAGCTTAGTT GGGAGTCAAC AGCACACCAA AGGCAAAGCC CCCGCAAACC TACCACGCTT TTGGAATTGG GAAAGGCTGT TTTTACCATG GTTAATCAAT CCACCGATCC AGCCATTCGA TCTTGAGCAG CAGCGAGCTG CTGGCCAAAT GCTCGACTTA GAGGGCGATA CCCGAACGCA TGTAGGGATT AAGCCAAGCA CCCAAATCGC TGAGGATGGC CAATTATTCC AAACCCAAGG CCTGAGTTTT CAATCGCTTG AGGCTGGCTA TGGCCTAGGT TTATGGAGCA GCACGCCAAT CGATCCGGCA GTTGCCAGTT TAGGTGCTGA ACGTAGGTTA GTTGAATGGT ATCGAGTTCC AAGTGGTACT GCCGCTGGCA TTGAAACAAT TGAGCCAAGC ATTCTAGCCG CGATTGTCCA AACCAAACAT TGTCGGATTA TCCTGCTCAC ACCAGCCATT TGGGATGCAG GCTTTTATCC CAAACATTTA CCAACTTTTG GCTTGCCGAT GACAGCCACT ATTAAAGCCG TGGTAAACCC TCGGCCCGAA TATGTTTCAG GTTGGGATTT ACGCCTCCAA CGCCCAAAAG CAACCCGCCG CCTTGCTGCC GCCGGCACTG TGCTGTTTGT TGAGCTTACT GGCAGCGATG CTGAGATTGA GCAATGGGCA CGCCAACTCT GGTTTAGCAA CATCAGTAGC AGCGAGCAAG CTGCCAAAGA TGGCTTTGGA TTGGCAGTGT TGGGCATATG GAATGATGGC TATTTACAGG AGCTAGCATG A
|
Protein sequence | MTLDLWLIQP FNSLVIADGR PFENQPGARA KSLSFPPPSV TVGGFRARAG VKANYNFSAK LSDPEYTNLL AMQVRGPILL KETVDLLIPF VAIPADALIR QDGANVTCYR LQPLDPRYFP TEQTDLVLGQ QPDQPRFSLV GSQQHTKGKA PANLPRFWNW ERLFLPWLIN PPIQPFDLEQ QRAAGQMLDL EGDTRTHVGI KPSTQIAEDG QLFQTQGLSF QSLEAGYGLG LWSSTPIDPA VASLGAERRL VEWYRVPSGT AAGIETIEPS ILAAIVQTKH CRIILLTPAI WDAGFYPKHL PTFGLPMTAT IKAVVNPRPE YVSGWDLRLQ RPKATRRLAA AGTVLFVELT GSDAEIEQWA RQLWFSNISS SEQAAKDGFG LAVLGIWNDG YLQELA
|
| |