Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5252 |
Symbol | |
ID | 5737210 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009974 |
Strand | - |
Start bp | 22374 |
End bp | 24107 |
Gene Length | 1734 bp |
Protein Length | 577 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641282416 |
Product | integrase catalytic region |
Protein accession | YP_001548007 |
Protein GI | 159901762 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.56302 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCAAAC GTGCCGCTGC GCCTGCGTGG CAACAACTCC AAATGCAGAT TACCTGCCCC AAACAGTATC TCTATGAACT CATTCGCCCC ATTGTCTTAA CCGAAGAGTC GGTGGCCGAC CGCGCTAGCG AAACCGCCGT GCCCGATCAC ATCATCACCT ATCATCTTGA CCATTTTCGC ACCAATGGCT TACCTGGCTT GGTTGCCGCT CCCACGCCCC TCCAACGCGC TGCCCGCTTT CCCGAAGAAT TGGTCGCCTT TGTGCTGGCG CTCAAAGCGG AGCATCCGCC GCTCACCGCC CATGAACTGG CTACGATTTG TTTTATCAAA CACGGTCGGC GGCCCAGTAT CAAAACCATT CGGCGCGTGC TCGCCCATCA CCCGCTCCCG CAGCTCACCA CGCGGCGGTT CCCTCCGTTC CATGACAATC CTGATCCCAT CCAACGCCGC CACGCCATCC TGGTCTTAAG CCTTGAGGGC TGGACGAAAA AACGGATTGC GACCTATCTT CAGATTAGCC GTTCCACCGT GTACAACACC TTTGCACGCT GGCACAAGGA GGGCTTCGCA GGGTTACAAG CAAAGTCCCG TGCGCCGCGC CGTCGCCATC CCAAAGTCAC CATCGCTATC CAGCAGCGGG TTCGCCGCTT ACAGCGCAAC CACTTGTTGG GTGCGTGGCG CATGCATGCC GCTCTGCGCC GTGAGGGCAT CCGCCTCAGT CCGCGCACCT GTGGGCGCAT TATGGCGGTC AATCGCGACC TCTGCCCCGA ATTGCCCAAA CGGCAGCGAT CACGGAAGCA TGAACCGCGT GCCATGCCCT TTGCTGCCCA GTACCGACAT CAGTACTGGA CGATTGACAT CCGCTATCTC GATATGCACC GCTTGGGTGG CGGCCATATT TACTGTATCT CGATTGTCGA AAATTACAGC CGCGCCATTC TTTCCAGCGC GATCAGTCGG ATTCAGGACA CCACTGCCGT GTTGAAAGTG CTCTATGATG CGGTGGCAAA GTACGGCTGT CCTGATGGGA TTGTGAGCGA TTCAGGCAGT GTGTTCCGCT CGCATCGGCT CCAGGAGGTG TGTCAGCACC TGCGGATTCA GCAGTGCCCG ATTGAGAAGC GCCAACCCTG GCAATCCTAC ATCGAGACGA CCTTCGGTAT CCAACGGCGT ATGGCTGACG AAGCAGAGGA GGGGTTTCGG GCCGCGCAGA GTTGGGATGC GCTCTGGCAC GCGCATCGCA CATGGCTACT CCACTACAAC ACCGAAGTGC ATTGGGCACA TCGGCAGCGC CAAGATGGGC GAGAAACGCC AGCCGAAGTC CTGACGTGGA TTCGCGGTCG CCCCTATCCG GAACGGCTGC TCCAGCGCAT TTTTGCCGCA ACACGGGTGA AACGGCGTTT GGATCGGGTG GGCTTTCTGC GGGTGCGCCG CTGGCGGATC TACAGTGAAA TTGGCTTGGC CAAGGAAGCC GTTGAGGTAT GGCTGGAGGC GCAGCATGTC ACGATTACCT ATGCGGATCA CCACCTGCGG TCGTATCCGG CGACGATTGA TACCGATGGG TGGATACGGA CGATGGAGAC GGGCACGCGC TACGAGCATC CCTTTGGGAG TCGGCAGTTA ATGCTGTGGG ATGTGATTGA AGACGACTGG GGCAAGACCC AGTATGTCGG CAGTACGCCG CGCCGAAAGT CCACCATCCC CCTCGCAGAG CAGTTGCGCT TCGCTCTTGG GTAA
|
Protein sequence | MPKRAAAPAW QQLQMQITCP KQYLYELIRP IVLTEESVAD RASETAVPDH IITYHLDHFR TNGLPGLVAA PTPLQRAARF PEELVAFVLA LKAEHPPLTA HELATICFIK HGRRPSIKTI RRVLAHHPLP QLTTRRFPPF HDNPDPIQRR HAILVLSLEG WTKKRIATYL QISRSTVYNT FARWHKEGFA GLQAKSRAPR RRHPKVTIAI QQRVRRLQRN HLLGAWRMHA ALRREGIRLS PRTCGRIMAV NRDLCPELPK RQRSRKHEPR AMPFAAQYRH QYWTIDIRYL DMHRLGGGHI YCISIVENYS RAILSSAISR IQDTTAVLKV LYDAVAKYGC PDGIVSDSGS VFRSHRLQEV CQHLRIQQCP IEKRQPWQSY IETTFGIQRR MADEAEEGFR AAQSWDALWH AHRTWLLHYN TEVHWAHRQR QDGRETPAEV LTWIRGRPYP ERLLQRIFAA TRVKRRLDRV GFLRVRRWRI YSEIGLAKEA VEVWLEAQHV TITYADHHLR SYPATIDTDG WIRTMETGTR YEHPFGSRQL MLWDVIEDDW GKTQYVGSTP RRKSTIPLAE QLRFALG
|
| |