Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0100 |
Symbol | |
ID | 5731993 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 129796 |
End bp | 131118 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641277222 |
Product | type III restriction protein res subunit |
Protein accession | YP_001542880 |
Protein GI | 159896633 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1061] DNA or RNA helicases of superfamily II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000273539 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCGAC CTGCCACCCA AGCCGAGCAG CCAGAGCAAG AACAATGGCT GCTCGACGAG GCAAGTTTGC TCGATGCAAG CCTGCTTGAT GAACCCGACG AGGATTTGCC AGAGGTTGAT GCCCGTTTAC AAGGCGGCTT CAAAGCCAAA TTACAATTCG CCCCGCGTCC CTATCAAACC GAGGCCGTCG CTGCTTGGAC GGCCAACGAA GGACACGGGG TGATTGTGCT GCCCACTGGA GCAGGCAAAA CGATCACAGC GATGTTGGCA ATTGCCAAGC TTGGGCTGCG CACGCTGATT GTCGTGCCAA CGATTGAATT ACTCTATCAA TGGCGCGACA CCGTGGTCCA AACCCTCGCG CTCGATCCAA AATTAGTCGG CGTGGTTGGT GATGGCCAGC GTGAATGGCG ACCAATCACC GTGATTACCT ATGCTTCGGC GGCCATGCCC GATGCACCAC TTGAAAATTT GGGCTTGCTC ATTTGTGATG AAGTGCATCA TTTGCCTTCA CCAGCCTATA GCACGATTGC CCTGCGCAGC CGCACGCCCT ATCGCCTAGG CTTAACCGCC ACACCCGAAC GCAGCGATGG CTCGCATACC GCGCTTGATC GCTTGGTTGG CAAAGTGGTT TATCAGCGTG CGCCTGCCGA TTTAGCCGAA GAAGGCCATA TCGCCAAGTT TCGCGAAAAA CGGATTTTGG TCGATCTAAC TGCCGATGAG TTGGTGCGCT ACGAAACCTT GATGACCACA TGGCGCTGGT TTTTGGCCAA ACATCGGCAT AAATTGGCCA GCGGCGGCGA TTTCTTCGGC GAACTCATTC GCCGTTCTGG TAGCGACCCC CAAGCTCGCA ATGCGCTCCA AGCTCAGCAT CAAGCGCGAA TGATTGCGCT CAACGCCGAG AAAAAGCTTG AGCATGTTGG CCAACTGCTC AGCCAACATC CCAACGATAA AATCATCATC TTTTCGGAAT ACAACGCCTT GGTCAATACG ATCAGCCGTC AGTTTGCCAT ACCCAGCATT ACCTATCGCA CAGCTGCTGA TGAACGTAAA TCAATTTTGG ATGGCTTTCG GTCTGGACGC TATTCCAAGT TGGTGACGGG GCGGGTGCTG AACGAGGGCG TTGATGTTCC CGATGCCAAT GTCGCGATTG TGGTCAGTGG TTCAGCCACG GCCCGCGAAT ATATTCAGCG TTTGGGGCGG GTGCTGCGCA AAAAGCCCGA TGAAGCCTTG CTTTACGAGT TGGTCACGCG CAATACCAGC GAAGTTCAAA CCGCCCGCCG TCGTAAAAAA GCCGTGGCCG CCCATAATCT GCAGGCGAAG TAG
|
Protein sequence | MTRPATQAEQ PEQEQWLLDE ASLLDASLLD EPDEDLPEVD ARLQGGFKAK LQFAPRPYQT EAVAAWTANE GHGVIVLPTG AGKTITAMLA IAKLGLRTLI VVPTIELLYQ WRDTVVQTLA LDPKLVGVVG DGQREWRPIT VITYASAAMP DAPLENLGLL ICDEVHHLPS PAYSTIALRS RTPYRLGLTA TPERSDGSHT ALDRLVGKVV YQRAPADLAE EGHIAKFREK RILVDLTADE LVRYETLMTT WRWFLAKHRH KLASGGDFFG ELIRRSGSDP QARNALQAQH QARMIALNAE KKLEHVGQLL SQHPNDKIII FSEYNALVNT ISRQFAIPSI TYRTAADERK SILDGFRSGR YSKLVTGRVL NEGVDVPDAN VAIVVSGSAT AREYIQRLGR VLRKKPDEAL LYELVTRNTS EVQTARRRKK AVAAHNLQAK
|
| |