Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3831 |
Symbol | |
ID | 5735695 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4810330 |
End bp | 4811574 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641280983 |
Product | transposase IS4 family protein |
Protein accession | YP_001546595 |
Protein GI | 159900348 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000386672 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTTCAC ACAACGACCT CGGGTTGGTA TACTGGGTGG TGACGAAACC AAACCCATCC CAAAACCGAG GTCGTATGCC GAGTATACCA GCCCTTGCCC ACTGGATCAA CACCATCCTG CGCACCGCCG TGCCCACCCT CTCGCCCTGG ACTGCCCGTC GGCTCACCGA TTGGCTCGTC AGCATCCTGC TCATGCCGTC CATCACCACG CGCGTCGTGG CCTGGGGCTG TGCCCTTGGA CTGTCCACCG CTGCCCACGC CGCCAGTCAC GAACGCCGAC TGCGCCGCAC CTATCGGGAT TCCCAGCTGT CGTGGTCGCT CCATCGCGCC ATCCTCGCCA CCACCCTCCA CATCGCACCC ACTGAATCCG TCACCGTTAT CATCGATGAA ACCACCCACA CCGACCGCTG GACCCTGCTC ACCGCCGCCC TCTGGTATCA CGGTCGCGCC ATTCCGCTTG CCTGGGTGCT CCATCCCGGC TATACCCGCC GCGCCACCGC CTTCTGGACC GATGTTGCCA CCCTGCTGGA GCGCGTGCAG CAGGTGCTGC CCAATGCCAT GTCCGTCGTC GTCGTGGCCG ACCGTGCTTT TGGCTGCCCC GCCTTCACCG ATCAGGTCGC GGCCTACGGC TGGGGCTGGG TCGTGCGCGT CCAAGGCCAT ACCCGCATCC AACTGCGGGG GCACACCGAA ACCATGATCC GCACGCTGGT CACGCGAGGC CATCGCGTGG TGCGGCGGGG TCATGCCTTC AAGAAGGCGG GCTGGCGAAC GGTGACAGTG GTGGCCGCAT GGGAGGCGAC GTGTCACGAG CCGTTACTGC TGGTGAGCAA TCTGGAGGGC ATTGGGGCGA TTCGGCAGGC GTATGGGCGG CGCTCTGCGA TTGAGGCCCT GTTTCGCGAT TGGAAAACGG CGGGCTGGCA ATGGGAGGCG AGCCAGTCGC GGAGCCAGAC GACGCAGGAG GCCTTGGTGC TGGGCATGGC GATCGCGACG GTGCTGGTGC TGCTGGTCGG GACGGCGGAG GCGCAGGCGG TGCTGGCCGA ACGCGGGGAT CGCCCCAGCC CGCGCCGCCC ATGGGCGGCA CGAGAAAGTC TGTTTCGGTT GGGGCGGTAT GGGGTGCTGC GCTGGCTGTG GACGGGAACG CAGCCAGCGC TGGGAGCGCG ACTATCGTTG GCGGGAACGG CGCTGCACGA ACGGTGGGCC ACGACGGTGA CGCGGGGTGG TCGGCTCGGG ACGGCCATCC CCTAA
|
Protein sequence | MGSHNDLGLV YWVVTKPNPS QNRGRMPSIP ALAHWINTIL RTAVPTLSPW TARRLTDWLV SILLMPSITT RVVAWGCALG LSTAAHAASH ERRLRRTYRD SQLSWSLHRA ILATTLHIAP TESVTVIIDE TTHTDRWTLL TAALWYHGRA IPLAWVLHPG YTRRATAFWT DVATLLERVQ QVLPNAMSVV VVADRAFGCP AFTDQVAAYG WGWVVRVQGH TRIQLRGHTE TMIRTLVTRG HRVVRRGHAF KKAGWRTVTV VAAWEATCHE PLLLVSNLEG IGAIRQAYGR RSAIEALFRD WKTAGWQWEA SQSRSQTTQE ALVLGMAIAT VLVLLVGTAE AQAVLAERGD RPSPRRPWAA RESLFRLGRY GVLRWLWTGT QPALGARLSL AGTALHERWA TTVTRGGRLG TAIP
|
| |