Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1324 |
Symbol | |
ID | 5733216 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1534772 |
End bp | 1536100 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641278462 |
Product | transposase IS4 family protein |
Protein accession | YP_001544097 |
Protein GI | 159897850 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCATCA TACCGCACGT CAGTGACGCG ATGCAAACCG TACTGACCAC GACCACCGAA ACCGTCGCGG CAACCCTCGG CTATGTGAAG CGGGCCGATC GAGCGACCTT CACGCCGAGC ACCTTGGTGC AGACCCTCGT CTATGGCTGG TTGGCGAACC CAACCGCCAG TTTAGGCCAA TTGGCTCAGA TGGCGGCACG GGTAGGCGCG ACGGTCTCAC CGCAAGCGAT TGATCGCCGC TTTACGCTGG CCACCGTCGA TCTGCTCCAT CACGTGTTAC TGGCCAGTAT GGAGTATGCG ATCAGTGCCG ATCCCGTGGC GGTGTCCATC CTCCAACGGT TCACCAGCGT GCGTATTCAT GACAGCACGA CGATCGGCCT GCCTGATGCG CTGGCGACCA CGTATCGTGG CTGTGGCAAT GCTTCGGCAC GCGGGACGGC AGGCTTGAAA TGTGGTGTCC AGCTCGATCT CCTCACGGGA ACCCTGTGTG GGATCGACCT CACGGACGGA CGAGCTTCGG ATCAGGTGTT ATCGGTTCAA CGTGCCCCGC TGCCTGCTGG GAGCCTTCGG CTGGCGGACC TCGGCTTCTA CAACATCCGT ATCTTTCGTG AGCTTGCTGC CGCCGAGGTA TATTGGCTGT CACGCGTCCA GAGTCACAGT CGGATTCGGC TGCCAGGACA GAAAGAACAG TCAATTCTGG AGGTCGTGAC GGGGTTGGGG GATGCGGATC ACTGGGAAGG GACGGTGCTG GTGGGAAGCA AGGAGCGACT CGCGGCCCGC TTATTGGTGC AACGCGTCCC CGATGCCGTG GCGGCACAAC GTCGCCAGCG GGTACAAGAC GAGGCGCATG ACAAGTGCCG CCCAGTCTCC AACGCTGCCA TGGATCTGGC GGCATGGACG GTGGTTATCA CGAACGCGCC GGAAGATAAG CTCGGCCTCA CCGAGGCCAT GGTCTTACTG AAAATGCGCT GGCAGATCGA GCTGTTGTTT AAATTGTGGA AGAGTCATGG CCACGTAGAT GAATGGCGAA CGAAAAAACC TGCCCGGATT TTGTGCGAAA TCTATGCGAA ATTGCTTGGA CTGGTGTTCC AGCAGTGGAT TCTGGTGGCA AGTGCGTGGG ATACGGCGGA ACGCAGTCTG TTCAAAGCGG CGCAAATCGT GATGGCCTAT GCGACTGATC TGGCGAGCAG TCGCGGATGT CGTGAGCAAT TGGAGACGGT GCTCACGACC CTTGCGAGCA TCATTGGGCG GTTGGCGCGG GTGCAGAAAC GTCAAAAACG GCCATCGACA GCGCAGCGGT TGTTGGCCTT AACTGCTGGG AGCGGCTAA
|
Protein sequence | MSIIPHVSDA MQTVLTTTTE TVAATLGYVK RADRATFTPS TLVQTLVYGW LANPTASLGQ LAQMAARVGA TVSPQAIDRR FTLATVDLLH HVLLASMEYA ISADPVAVSI LQRFTSVRIH DSTTIGLPDA LATTYRGCGN ASARGTAGLK CGVQLDLLTG TLCGIDLTDG RASDQVLSVQ RAPLPAGSLR LADLGFYNIR IFRELAAAEV YWLSRVQSHS RIRLPGQKEQ SILEVVTGLG DADHWEGTVL VGSKERLAAR LLVQRVPDAV AAQRRQRVQD EAHDKCRPVS NAAMDLAAWT VVITNAPEDK LGLTEAMVLL KMRWQIELLF KLWKSHGHVD EWRTKKPARI LCEIYAKLLG LVFQQWILVA SAWDTAERSL FKAAQIVMAY ATDLASSRGC REQLETVLTT LASIIGRLAR VQKRQKRPST AQRLLALTAG SG
|
| |