Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0676 |
Symbol | |
ID | 5732576 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 774329 |
End bp | 775465 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641277805 |
Product | P4 alpha zinc-binding domain-containing protein |
Protein accession | YP_001543452 |
Protein GI | 159897205 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0358] DNA primase (bacterial type) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0255686 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGAGC ATCTTTCGGG CGATGATTTT CTTTCCTTGG TTGGCCAGCG CACCGCCCTG CGCAAAGTGG CCGCAACCAA TGGCGGTGAA TGGGCTGGAG CCTGCCCCTT GTGTGGTGGG AAGGATCGCT TTCGAGTCCA ACCGCATGGG GATCGTGGTG GCCGCTGGTG GTGTCGCAGT TGTCGCGATG GCCAGCCGTG GGCTTCGGAT ATTGATTTTG TATTCCTGCG CGATCAGGGC TTTATCCCCA GCAACGATCA TCCAGAATGG CCAGAATTGA TCAAAGGGGC TTTTGCCACA CTGGGGTTGA CGATCGAGCA ATCACGGCCA TCGACGAGAG TCGTCCCTGA CCCTGAGCCA CCGACTGCCT GTGAGCCACC CGCAGCGCTC TGGCAACAAG CTGCCCTGAG TTTTCTTGGC TACGCGGTTG ATATGTTGTG GAAGGATGGG GAGAGCCAGC CCTCGCCACG CACCTACTTA CACCAACGCG GCCTAACCGA TCACACGCTC CGCTCTGCCG CGATTGGCTA TAACCCCAAG CCGCTCTACC GTCCGCTGCA CAAGTGGGGA CTGACAGACC CAACGAAAGA CGGCGTGTGG TTGCCTGCGG GCTACGTCAT CCCATGGTTC TGTGATGGAC AGCTCTGGAA GCTCTCGATT CGCCAAACCA CCCCACGCGA CGAGGGAATG AAATACGTGA CGATTGCTGG CAGCAGCAAC GTGCCCTATG GCATTGATAC CTTCCGCGCA GGTCGCCCCG GCATTCTGGT GGAAGGCCCG ATTGATGCGT TGGTCGCCCA ACAGGCGCTC GGCCAATTCG AACGCCATGG GCAGCCACTC AGCGGGGTCG CGGCGATTGG CACAACCCAA GGGCGCAGCT TTCGCTGGCT CGTCCAATAC AGCCTCTGTC AACCACTCTT CATCGCCACC GATGCCGATC AGGCAGGCGA TAAGGCCGCT GCCTATTGGC TCGATATCTT CAAAGGCAAA GAGGTTGCCC GCATGCGGCC TGACCCGCAC GATTTTGGCA CGATGGTCGT GCAGTGTGGT ATGGATTTAC GGGCGTGGAT TGCCACCCAT CTTGATCGAT GGTGGACGAG CGACCTCGCC CCTAGCCCGA TTCAATCGGT TGCCTAA
|
Protein sequence | MIEHLSGDDF LSLVGQRTAL RKVAATNGGE WAGACPLCGG KDRFRVQPHG DRGGRWWCRS CRDGQPWASD IDFVFLRDQG FIPSNDHPEW PELIKGAFAT LGLTIEQSRP STRVVPDPEP PTACEPPAAL WQQAALSFLG YAVDMLWKDG ESQPSPRTYL HQRGLTDHTL RSAAIGYNPK PLYRPLHKWG LTDPTKDGVW LPAGYVIPWF CDGQLWKLSI RQTTPRDEGM KYVTIAGSSN VPYGIDTFRA GRPGILVEGP IDALVAQQAL GQFERHGQPL SGVAAIGTTQ GRSFRWLVQY SLCQPLFIAT DADQAGDKAA AYWLDIFKGK EVARMRPDPH DFGTMVVQCG MDLRAWIATH LDRWWTSDLA PSPIQSVA
|
| |