Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_2682 |
Symbol | |
ID | 7400889 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 2669892 |
End bp | 2671721 |
Gene Length | 1830 bp |
Protein Length | 609 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643709756 |
Product | conserved repeat domain protein |
Protein accession | YP_002567323 |
Protein GI | 222481086 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1361] S-layer domain |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.549235 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGTA TCCGGAAGAT CGCGGTCGTC GCGCTTGTCT GCTGTCTGCT AGTGGCCGGC ACCGCGCTGG TCGCGGCCAA CGACACGGTC AGCGGCGATC CCGACATCGA GGCGTTCGCG CCCGAGACGG CGTTCGTTCC CGGCGAGGAG TCGACGCTGC AGGTGAGTCT GAACAACCGC GGCGACGTGA GCGAGGAGGG GTTCGACGAC CTCGAAAGCG AGGTGGTGAC CGCCCACGAG ACCACCGCGC GGATCCTCTC CGGCGACGAG ACCAACCGCG ACGTTCCGTT CGACGTGCGC ACCGGCGAAC AGACGGTGGG CGACGTGCCT CGGGGCGTCA CCGGTCCGAT CGACTTCACC GTCGTCCCCG ACGAGGACGC GGAACCGGGC GTCTACCAGG TCCCGATCCG ACTGGAGTAC CGGAACGTCT ACAACGCCGA GGACGACAAC GGCGCGACCC TCCGCGACGA GCGCGTCGAG ACCGAGACCG TGGTCGTCGA CGTCGAGATA ACCGACCGCG CGCAGTTCGC GGTCACCGCC GTCGACGGCG CGGTCCAAGC CGGCGACACC GGCGTCGTCG ACGTGACGAT GCGAAACGTC CAGAACGAAA CCGCCCGCGA GGCGTCGGTG GCGGCGACCC CGGTCGACCC TGATCTCACC TTCACCACCG AGGCCGGCAC CACCGAGACG TACGTCGACG ACTGGGCGCC CGGCGAGAAC CGCACCTTCA CCTACCGGTT CGACGCCGCC GCCGACGCGA CGCCACGGGC TTCGACGCTG GAGTTCGACG TGGAGTACCG AGACGCCGAG CGTGCGGACG CTACCGCCCG AACGGTTCGG ACCGGCGTGA CCCCGCTCTC GCGGCAGGCG TTCGACGTGA CCGGCCTCAA CAGTTCGCTG GAGGTCGGCA AGGACGGCTC GTTCACGGTC GCGGTCCGCA ACGACGGCCC GCGTCCGGTG GAGAACGCCG TCGTCGCCTT CGACAACGAG GCGCCGGCGC CGGAGGGGGT CGGGGCGGAC ACGATTCCGA CCGACGAGAA CGTCGTCCCC CGCGAGACGC GGGTGACCGT CGGCGACCTC GGGGTCGGCG AGACCGCGAC GGCGACGTTC GACGCCGGGA TCCGCACCGA CGCAACCCCG GGCAATCGGA CGCTCAACCT CGTCGTGCGC TATCGCGGGC TCGACGACGA CGTGGTCGTC TCCGACGCGT ACGACGCCGT CGTCGATGTG CGCCCGGAAC AGGAGACCTT CGCCGTCTCA CCGGTCGAAC CGCGTGTCGA CGAGGACGGT GCGGGCGGTA ACGAGACCGG CAGTGACGGT AACGGGACTG ACGAGGGCGA CAACAGAACC GCCGCCGTCG CCGGCGCCGC GACCGGAATC GCGCCGGGTG AGACCGCCCG GTACGACGTG GTCGTTCGTA ACACGGGCTC CGAGCCGGTC TCGGACGTGC AGGCGAAGCT GTTCGTGGAC GAGCCGATCT CCTCGGACGA CGACGAGGCG TTCGTCACGT CGCTGGATCC GGGCGAGGAG ACGACCCTGC GCTTCGAAGT AAGCGCTGAC GGCGGGGCCG CGCCGAAGAC GTACTCCGCC GCGGTCGACT TCCGGTACGA CGACGCCGAG GGCGACGAGC AGCTCTCGGA CACCTACCGG CTCCCCGTCG AGGTCGCCAC CGACGACGGT GGCAGCGTGC TCCGGTCACC GGTCGGAATC GTCGCCCTGC TGAGCGCCCT TGCGATCGGC GCCGCGGCGG TCTGGAAGCG CGGCCGGGTG AGCCGAGCGA TCTCGCGGTT CCGCCGGCGG GCCCGCGACC GGTTCGGCGG CAACCGGTAG
|
Protein sequence | MSRIRKIAVV ALVCCLLVAG TALVAANDTV SGDPDIEAFA PETAFVPGEE STLQVSLNNR GDVSEEGFDD LESEVVTAHE TTARILSGDE TNRDVPFDVR TGEQTVGDVP RGVTGPIDFT VVPDEDAEPG VYQVPIRLEY RNVYNAEDDN GATLRDERVE TETVVVDVEI TDRAQFAVTA VDGAVQAGDT GVVDVTMRNV QNETAREASV AATPVDPDLT FTTEAGTTET YVDDWAPGEN RTFTYRFDAA ADATPRASTL EFDVEYRDAE RADATARTVR TGVTPLSRQA FDVTGLNSSL EVGKDGSFTV AVRNDGPRPV ENAVVAFDNE APAPEGVGAD TIPTDENVVP RETRVTVGDL GVGETATATF DAGIRTDATP GNRTLNLVVR YRGLDDDVVV SDAYDAVVDV RPEQETFAVS PVEPRVDEDG AGGNETGSDG NGTDEGDNRT AAVAGAATGI APGETARYDV VVRNTGSEPV SDVQAKLFVD EPISSDDDEA FVTSLDPGEE TTLRFEVSAD GGAAPKTYSA AVDFRYDDAE GDEQLSDTYR LPVEVATDDG GSVLRSPVGI VALLSALAIG AAAVWKRGRV SRAISRFRRR ARDRFGGNR
|
| |