Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1212 |
Symbol | |
ID | 7399480 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 1222965 |
End bp | 1223963 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643708278 |
Product | transposase IS4 family protein |
Protein accession | YP_002565876 |
Protein GI | 222479639 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000262262 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00000730642 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTCTACGA GCGCCAGCAT CCTGCAAGAG GAGACTTCTA TCGACGAGTT CTTCAATGTA ATGGCGACCG AGACGCTCGC GTTGTTCGAG CATCTTGAGT TCGACTTTCT CGAAGAATTC GATGTGTTCG CCCCCGCTCG CCGGGGGCGA ACACGAGATC ATCACCCACC AGCACTCTTC CGAGCGTTCC TGCACTGCTA CTACAAGAAC GTCTACGGCA TCCGTCCAGT CACGCGAGAA CTCCAGAACA CGGTCGTCTG GCTCAGCTGT GGCTTCGATC GACCGCCGTC GAGAGACGCG GTCGATCGCT TCCTCACCGA CCTCGAACAC GTCGTCGACG AGGTCTTCGA CCGCCTCGTC GAGCAGGCCG CCTGCCGCGG CCTGCTCGAC TTGACCTACT CCATCGATTC CACCGACGTG AGGACGATGC CCGCCGACCA AGACGCGTCG AAAGGCTACG ATCCAACCGC CGAAGAGTAC TACCACGGCT ACGGCTGTAC GATCGTCTCG ACCGGGCAAA AGATCCCGAT TGCCGCGGAG TTCACCGAGA GCAAGCAAGC GCCAGAGGAG ACGGCGATGC GCGTCACGTG TGACGCGCTC GCCGTCGAGA AACCGATCTG GATGCTTGGA GACAGCGCCT ACGACACGCT CGGCTGGCAC GACCACCTGC TGGCCGCAGG GGTCGTGCCA GTCGCTCCGT ACAACGCACG AAACACCGAC GATCCGAAAG ACATCGAGTA CAGGGTCGAA GCCCGCATCG ACGAACACAG CGAGGACGTT CAGCTGAAGC AATCGACGCT AGACGAGACG TACAACCGCC GGAGTGGAGT CGAACGAACC AACGACGCCG TCAAGGACTG CGGCCTCGGG CACGTTCGCG CCCGAGGCCG CGTCCACGCA CGAGCACAAG TGTTCCTCGC GCTGTGCCTT CGTCTCGTTA TTGCGATCAC CAACGACGAA CGCGGAGACA ATCCAGGAAG CACCGTCATC ACGCTATGA
|
Protein sequence | MSTSASILQE ETSIDEFFNV MATETLALFE HLEFDFLEEF DVFAPARRGR TRDHHPPALF RAFLHCYYKN VYGIRPVTRE LQNTVVWLSC GFDRPPSRDA VDRFLTDLEH VVDEVFDRLV EQAACRGLLD LTYSIDSTDV RTMPADQDAS KGYDPTAEEY YHGYGCTIVS TGQKIPIAAE FTESKQAPEE TAMRVTCDAL AVEKPIWMLG DSAYDTLGWH DHLLAAGVVP VAPYNARNTD DPKDIEYRVE ARIDEHSEDV QLKQSTLDET YNRRSGVERT NDAVKDCGLG HVRARGRVHA RAQVFLALCL RLVIAITNDE RGDNPGSTVI TL
|
| |