Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2039 |
Symbol | |
ID | 5733928 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2542293 |
End bp | 2543429 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641279183 |
Product | transposase IS4 family protein |
Protein accession | YP_001544810 |
Protein GI | 159898563 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTCAAA GAAAGGTGAT CATCATGGGG AGCAGTCATG AATTATACAC GCGGGTTTGG ACGACGTTGC ACCAGTTTCA TCCAACCCTC CATGCACGGC GACTGGCGAC CTGGGCTTGG GTCATTGTCG GCTTACTCCA TGCGCGATCC GTCCATCTTA GCGCTGTGGC GCTCCATCTG GCGAGCGATG CTGAGGCCGC TGGGCGAATC GCACGCATTC GGCGCTGGCT CGCCAATCCG TGGCTTGATA CCCAGTTTCT CTATCGTCCG CTCATTACCC ATGTGCTCAC GGCTTGGCGC AATCGCGACA TCACTATCAT GATTGACGGG TGCTACGTCA ATCACGACAA ACTCCAGATG GTTCGCCTGT CCTTATCCCA CTGTTATCGG GCAATCCCTC TCGCGTGGCA GGTCATGAGC CATCACGGGA ACGTCTCCGT GGAGTCATGT CAGCGGATGC TTAATCGGGT ACAACAACTT CTGATCGGAA CCCGTCGTGT GACGTTTCTT GCGGATCGGG GCTTTCGCGA TTGGGCATGG GCTGCAAGCT GCCAGCGCCG CGGCTGGGAT TACATCATTC GGATCGCAAA TACAACGACC ATTCGCTGGG ATGATGGCCC ATGGATGGCG ATCAACACTA TGGCAGTAAA GCCCGGCAAG TCCGTCTATC TGCGCAATGT TTTGCTCACC CAAGACGGAG AATGGCGCTG TACTATCGCC ATTACGTGGA CACGTGCCAC GAAAACCAAG CCTGCGGAAC GATGTGCGGT AATAACCAAC CGAGAGCCGA GCAAATGGAT TCTGAACCAT TATTTGCGCC GTATGCATAT CGAAGAGAGC TTCCGCGATG ACAAATCGGG CGGATTTGAT TTGGATGCCA GTCGCCTGCG CGATCCGCAG CGGCTTGATC GGCTGCTATT GGCGATCGCC GTGGCAACGC TCTGGATGTA TGAACTGGGG GAACGCGTAC TCAAGGATGA GCAACGTGCC CACGTCGATC CAGGCTATCA GCGTCAACTC AGTGTGTTTC AGCTAGGATG GCGTTGGCTC CGGCGAGCAT TGAGCCTTGC CGATATCCCG AAATGGAACC TCACGCTCCA TCCGTTTCAG CCTGAGCGGG TCGCAGCAAA GTGTTAG
|
Protein sequence | MLQRKVIIMG SSHELYTRVW TTLHQFHPTL HARRLATWAW VIVGLLHARS VHLSAVALHL ASDAEAAGRI ARIRRWLANP WLDTQFLYRP LITHVLTAWR NRDITIMIDG CYVNHDKLQM VRLSLSHCYR AIPLAWQVMS HHGNVSVESC QRMLNRVQQL LIGTRRVTFL ADRGFRDWAW AASCQRRGWD YIIRIANTTT IRWDDGPWMA INTMAVKPGK SVYLRNVLLT QDGEWRCTIA ITWTRATKTK PAERCAVITN REPSKWILNH YLRRMHIEES FRDDKSGGFD LDASRLRDPQ RLDRLLLAIA VATLWMYELG ERVLKDEQRA HVDPGYQRQL SVFQLGWRWL RRALSLADIP KWNLTLHPFQ PERVAAKC
|
| |