Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5206 |
Symbol | |
ID | 5737164 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 294579 |
End bp | 295616 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641282370 |
Product | transposase IS116/IS110/IS902 family protein |
Protein accession | YP_001547961 |
Protein GI | 159901715 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3547] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0981183 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCGAT CGCCCGTCTA TCACTGTTTT GTTGGGATCG ATATTGCCGC GAAGACCTTT GTGGCCGCGT ATGCGGAGCC AGGTCATGCA GCCAGCTCAC CGCGCACTTT TGACCAGACT GAGGCTGGAT TTGCCGCATT CCAAGCCTAC CTCCCAGCGA CAATCCTGCC CACGCAGATC CTCATCGCAA TGGAAGCCAC GGGGTCATAC TGGATTCGGC TGGCGGTTAC CCTGCATCAA GCAGGCTACG CGGTGGCCGT GATCAATCCA AAGCACATCC ACAATTTTGC CAAATCGCTC CCGCGTCGCG CGAAAACCGA TGCGCTTGAT GCCGACGTTC TTCTCCGCTT TGCGACTGAG CGCCAGCCAT CCTGTTGGAC TCCACCGCCA ACGGTCTATC ATGAACTGCG CCAACGCTTA CTTGCTCGCG ATGCCTTACT GGCCATGCGA ACCCAAGCCC GTAATCAACA GCATGCCCTC AGTCAGTGGC CCGTGGTTGT TGCGGAGGTA ACCGCCCACT TTAATACCGT CTTGGCGGCC TTGGATACAC AACTGGCGAT GCTCGAACGC GAAATCACCA CGACCATGCA GCTCGGTGAC TGGGCAGCAT CGGCCACGCT CTTGTTATCA ATCCCGGGGA TCGGTCTGCA CGCGACCGCG TGGCTGCTGG TGAGCACGCT GAATTTCACC TTGTGCGCAA CCCCGGAAGC AGCGGTAGCC TATGCTGGAT TAAATCCGCT TGCCCGCGAG TCTGGAACGA GTATTCGTGG CAGACCCCGT GTTGGTCGGG GCGGTAATGC CCGCCTACGA ACTGTCCTGT ATATGGCAAC ACTCAGTGCG AGTCGGTATA ACCCGCCAAT TCAGGCATTG TATACGCGCC TGCGGGAACG AGGAAAAGCG GTCAAGGTGG CTCGCTGTGC GGCAGCACGC AAACTGATTC ACCTTGCATG GGCTATTGTG AAGACAGGAC AACCGTTTGA TCCTGGCTAC CAGCAGAAGC TCCGTGAGCA CAGGGTCATG GTCGCTATTA CGGACTAA
|
Protein sequence | MNRSPVYHCF VGIDIAAKTF VAAYAEPGHA ASSPRTFDQT EAGFAAFQAY LPATILPTQI LIAMEATGSY WIRLAVTLHQ AGYAVAVINP KHIHNFAKSL PRRAKTDALD ADVLLRFATE RQPSCWTPPP TVYHELRQRL LARDALLAMR TQARNQQHAL SQWPVVVAEV TAHFNTVLAA LDTQLAMLER EITTTMQLGD WAASATLLLS IPGIGLHATA WLLVSTLNFT LCATPEAAVA YAGLNPLARE SGTSIRGRPR VGRGGNARLR TVLYMATLSA SRYNPPIQAL YTRLRERGKA VKVARCAAAR KLIHLAWAIV KTGQPFDPGY QQKLREHRVM VAITD
|
| |