Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5219 |
Symbol | |
ID | 5737177 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | + |
Start bp | 318880 |
End bp | 320592 |
Gene Length | 1713 bp |
Protein Length | 570 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641282383 |
Product | transposase Tn3 family protein |
Protein accession | YP_001547974 |
Protein GI | 159901728 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGAT TCTGGGATGT CGATGAGCTT GCCGCACATT TTACCCTTAC GGAGGAGGAT TGGACGCTCA TCGCCAACAA AACGGATGCA ACCCGCCTCG GCTTTGCCAT TTTGCTCAAA TACCTTCCCG TCGAAGGAGC CTTTCCCAAA CACGCGGCTG ATGTGCCGCC CATGGTCATT GACCACCTTG CCACGCAAGT CGGCGTGGCG GCTGATAAAT TTGGAACCTA TGCCTTCAAA GGCCGGACGA TTGAGGAGCA TCGGAGCCAG ATTCGCCTCG CGCTCGGCTA TCGCCCATCG ACCGAGCAGG ATCTCACGGA GCTGAGCGCA TGGTTGGTGG CCCAATGCCC GCCGGAAGAT CACTACACCA CAGCGGCAAA AGCGGCTGCC GTAGCACACT TGCATGCCCG TCATATTGAA CCACCAAGCC CGAAACAACT GCGCCGAGCC GTGAAATCCG CAGGTCGGAT GGCCGAGGAA CGGATGGCTG CGGGGGTTGC TCATCGCTTG ACTTCCTCGG TGAAGCAGGC ACTTGACGCA CTCCTCCAGA CCACGAGCCA TCCAGCCACC GCCGATGGGG GTGACGATGA TCCCGACGAC GATGAACCGA CCGCAGCGAC CCCGCTCGCA GAACGCCGGA CCGCATTGTT GACCGAACTG AAATATGATG CCGGTCGTGC CAGTCTCAAA AGTATCCTCC GTGAAATCGA TCGGTTGCAG ATCATCCGCA CCGTGCAGCT CCCACCACTC CCCGTCGATG GACTCCATCA CACCATTGTT CAAGCGCTCC GGCAACGCGT CGTGACGGAA GAACTGTTTG AACTCCGTCG CCATTCTGAC AATGTGCGGG CCTACATGTT AACTGCGTTT TGTTGGCTCC GAGGCCAAGA GATAACCGAT AACCTGATCG AACAACTCAA TCAGATTGTC TATAAAATTG GGGCCAAGGC CGAGAAAAAA GCGGATGACG CATTGGTCAA CGCGATCAAA CGGGTGCGTG GCAAGACCAC GATTCTCCGC AAGGTGGCCC AGCAATCGTT GGAACGACCA CACGATGATG TCGAGCAGGT CGTCTATCCA GCGGCGGGCG GCAAACATGT CCTTGAGGCG CTCGTGACCG AACTCACGGC CATTGATACC TATGACGATC ATGTCCATCA AACTATTCGG AGTTCCTACG CCAACCACTA TCGGCGGATG ATCCCGCCCA TCTTGCGCAT GCTGACCTTT CGCTCTAATA ACGAGGCCTA TCGGCCTGTC CTCCAGGCGA TTGACCTCCT CAAGGCCTAT GCGGATGTCG CTGGTGACTA TCCCTACTTT GCCGATGATG CCGAGATCCC CCTCATCGGG GTTGTTCCCG ATGCTTGGAT GCCCTTGGTG CAGAATGAAC ACGGTCGCGT CAACCGCATC GCCTATGAAA TCTGTGCGCT CCAAGCGCTT CGCGATCGGC TGCGTTGCAA GGAAATTTGG GTTGAGGGTG CACTCCGCTA TCGCAATCCC GACGAGGATC TGCCGAAGGA TTTTGACCAA CGCCGGGAGG AATACTATCA CGCGCTCGAC CAGCCCCTTG ATGCCGATGC CTTTATCGAG ACGCTGCAAG CCGAGATGCG ATTGTGGTTA AAAACCCTTG ATCGGGGATT ACCCAAAAAT GCGGCGGTGC GCATCTCGAA GCAACGCGGG CATTGGATTC ATCTGACCAC ACTAAGCCAA TAG
|
Protein sequence | MKRFWDVDEL AAHFTLTEED WTLIANKTDA TRLGFAILLK YLPVEGAFPK HAADVPPMVI DHLATQVGVA ADKFGTYAFK GRTIEEHRSQ IRLALGYRPS TEQDLTELSA WLVAQCPPED HYTTAAKAAA VAHLHARHIE PPSPKQLRRA VKSAGRMAEE RMAAGVAHRL TSSVKQALDA LLQTTSHPAT ADGGDDDPDD DEPTAATPLA ERRTALLTEL KYDAGRASLK SILREIDRLQ IIRTVQLPPL PVDGLHHTIV QALRQRVVTE ELFELRRHSD NVRAYMLTAF CWLRGQEITD NLIEQLNQIV YKIGAKAEKK ADDALVNAIK RVRGKTTILR KVAQQSLERP HDDVEQVVYP AAGGKHVLEA LVTELTAIDT YDDHVHQTIR SSYANHYRRM IPPILRMLTF RSNNEAYRPV LQAIDLLKAY ADVAGDYPYF ADDAEIPLIG VVPDAWMPLV QNEHGRVNRI AYEICALQAL RDRLRCKEIW VEGALRYRNP DEDLPKDFDQ RREEYYHALD QPLDADAFIE TLQAEMRLWL KTLDRGLPKN AAVRISKQRG HWIHLTTLSQ
|
| |