Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1938 |
Symbol | |
ID | 5733827 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2350188 |
End bp | 2351510 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641279082 |
Product | transposase IS4 family protein |
Protein accession | YP_001544709 |
Protein GI | 159898462 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000606768 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGATCA TACCGCAGAT CAGTCACGCT ATGCACACCC TGTTAACGAC CACGACCGAG GCCATTGCCG CTGCTCAGCA GTATGTCAAA CGCCCTGACC GCGCCAAATT CTCTCCCAGT ACCCTCGTTC AAACCCTCGT CTATGGCTGG CTGGCTCAGC CAACCGCCAC GGTGGAGCAA TTGGCCCAAA TGGCCTGCCG CATCGGCGTT GCTGTCTCTC CCCAAGCGAT TGATCAACGC TTTACCATGG CCACCGCTGA CCTGCTCCAC CAGCTCGTCA TCGCCAGCAT CCACCCCGTC ATCGCCGCCA ATCCCGTGAC CCTGCCCATC CTCCAACGCT TTGCCAGCGT GCGCGTTCAT GACAGCACCT CCATTGGCTT GCCCGATGCC CTGACCGGCA TCTGGCGCGG CTGTGGCAAT GCGACAACGG GCGGCGGAGC CACCCTGAAA TGTGGTGTCC AGCTTGATGT GCTCACGGGC GCGATCACCG CCCTCGATCT GGTCAACGGA CGCGCAGCGG ATCGAGCGCT CCCGCTTCAG CAGCGCGATC TGCCGCCGGG GAGTTTACTG CTCGCCGATC GCGGGTTTTA CCACTTGGAG CGGTTGCGCC AGCACGATCA GCAGGGGGTT TTGTGGATCA CCCGCCTGCC CAGCAACGCC GTCGTGGCCT ATCCGGGACA CGCCGCGCAG CCGTTGGCCA CGTTTGTCCG CGAGCTTGGC CCGGTGGCAA CGTGGGATTG TGCGATCATC GTGGGGAAGG AGCAGCAGGT GCATGGGCGG CTGATCGTCA CGCGGGTGAC GCAGGCGGTT GCCGATCAGC GCCGGGCACG GATTCGCCAG CATGCCCAGC ACCAGCATCG GATGCCGTCG GCAGCGGCCT TGGCGCTGGC GGATTGGAAT GTGGTCTTCA CGAATGTGCC ACGGCTGCTG ATCAGCACGA CCGAGGTCTG GACCGTGATG CGGGTGCGCT GGCAAATTGA ACTGCTGTTT AAGCTCTGGA AAAGCCATGC ACGAATCGAT GACTGGCGCA CGGCGAATCC GGCACGGGTC TTGTGTGAAA TCTATGCCAA ATTGATTGGG CTGGTGTTTC AGCAGTGGCT GCTGGCCGCC AGTAGCTGGC ATGATCCGGA GCGGAGTTTG TTCAAAGCCG CGCCGATTGT CGCGGGGATG GCGGGCGAAC TGGCCAGCAC GCAGGCTGAT CCGCCGCAGT TTTTGCGGGT GTTGACACGG CTCGCGGCCT TGATTCAGCG CTGGGCGAGG ACGAACAAAC GCCAGCAGCC ACCGACCACG GCTCAGCGTT TACGGGCATT AACGGCAGCA TGA
|
Protein sequence | MEIIPQISHA MHTLLTTTTE AIAAAQQYVK RPDRAKFSPS TLVQTLVYGW LAQPTATVEQ LAQMACRIGV AVSPQAIDQR FTMATADLLH QLVIASIHPV IAANPVTLPI LQRFASVRVH DSTSIGLPDA LTGIWRGCGN ATTGGGATLK CGVQLDVLTG AITALDLVNG RAADRALPLQ QRDLPPGSLL LADRGFYHLE RLRQHDQQGV LWITRLPSNA VVAYPGHAAQ PLATFVRELG PVATWDCAII VGKEQQVHGR LIVTRVTQAV ADQRRARIRQ HAQHQHRMPS AAALALADWN VVFTNVPRLL ISTTEVWTVM RVRWQIELLF KLWKSHARID DWRTANPARV LCEIYAKLIG LVFQQWLLAA SSWHDPERSL FKAAPIVAGM AGELASTQAD PPQFLRVLTR LAALIQRWAR TNKRQQPPTT AQRLRALTAA
|
| |