Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5050 |
Symbol | |
ID | 5737008 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | + |
Start bp | 66651 |
End bp | 67958 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641282215 |
Product | integrase catalytic region |
Protein accession | YP_001547806 |
Protein GI | 159901560 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCACGG ATCGCTGGTT TGCTGCTCGC CGAACGTTGT ACGAATTGCT CCACACGAAC CCCGACTGGT CGAATCGCCA GTTTGCCACC GCGCTCAACG TCTCCCCCGA TTGGGTTCGT CTCTGGAAAC AGCGCATCGG TTCGCCACCC CATCCCGATC CTGATGTCGT CTGTCAGAGC CAATCACGGG CACGCAAAAC ACCACCACCA GCGTGGAGCG ATCGCGTCAT TCACCGGATT TTGACCTTGC GCCAGGAGTT GGCCGCGCAG TTCCATCGCA CGGTTGGAGC CAAAACCATT CTGGCCTATC TCCAACGCGA TCCTGATCTC GCCGATGACC GGATTCCCCG TTCACCCACT ACCGTGAATC GCATCTTGCG CGATCATCAG CTGCTCGTAG ACCCACCCAC GCATCAGCGC CAACCCCGCA CCCCTTGTCC GCCCATGCAG GAGATTGAAA TCGATTTTAC CGATGTCACG ACGATTCCGA CCAACCCCGA TGGCAAACGT CAGCACGCCG CCGAAGCCTT TATGTGGGTC GATGCGGGCA CATCCATCCG CGTTGCCGCG CGGATTAGCA CCGATTTTCA TATGGCCTCG GTGATCCGGA CGACCGCCAG TATTCTCCAG CAGATCGGGT TGCCTGCGCG GATTCGGATG GATTGCGATG TGCGCTTGGT CAGCAACAAG CGCGTCGCCG ATTTCCCATC GCCCTTCCAA CGCTTGTTGC TCAATCTCGG CATTCAGGTT GACGTGTGTC CACCCCATCG ACCCGACTTA AAGCCGTTCG TGGAGCGGTT TCATAAAAAC TACAAGGGCG AATCGGTCTA TCCAAACTGG CCGACGACCG AGGCCGAAGC CCAAGTCCAG GTCGATGCCT ATTGCGATTG GTATCGTACC GAGCGCCCGC ACCAAGGCCG GGCCTGTGGC AATCGCCCGC CTGCCGAGGC GTTTCCAGAA TTACCCGTGT TACCACCGGT TCCGGCGCAG GTCGATGCGG ATGGCTGGCT GAAGCAAATT GACGGCTGGA CGTTTGTTCG GCGGGTCAAT GCGCAAGGCA AGCTCATGCT GGATGGCGCA ACGTATACGG CGGGGATCGC CTATGCAGGG CAGGAATTGG CGGTGCAGGT GGATGCTGCC GCGCGGGAAT TGGTGCTGAT CCAGCGTGAA CGCGCGGTCA AGCGGGTCAC GTTGAAGCGG CTCTTGGGTG GGATGATGCC GTTTGAGCAG ATGGTTGAGG CATTGTGTGG CTTGGCTGCG CAGGAAACCA AACGGCTCAA CCAACGCCAG CAGCGCCGCC GCCGATGA
|
Protein sequence | MVTDRWFAAR RTLYELLHTN PDWSNRQFAT ALNVSPDWVR LWKQRIGSPP HPDPDVVCQS QSRARKTPPP AWSDRVIHRI LTLRQELAAQ FHRTVGAKTI LAYLQRDPDL ADDRIPRSPT TVNRILRDHQ LLVDPPTHQR QPRTPCPPMQ EIEIDFTDVT TIPTNPDGKR QHAAEAFMWV DAGTSIRVAA RISTDFHMAS VIRTTASILQ QIGLPARIRM DCDVRLVSNK RVADFPSPFQ RLLLNLGIQV DVCPPHRPDL KPFVERFHKN YKGESVYPNW PTTEAEAQVQ VDAYCDWYRT ERPHQGRACG NRPPAEAFPE LPVLPPVPAQ VDADGWLKQI DGWTFVRRVN AQGKLMLDGA TYTAGIAYAG QELAVQVDAA ARELVLIQRE RAVKRVTLKR LLGGMMPFEQ MVEALCGLAA QETKRLNQRQ QRRRR
|
| |