Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0407 |
Symbol | |
ID | 7401024 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 423835 |
End bp | 425229 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643707471 |
Product | protein of unknown function DUF21 |
Protein accession | YP_002565080 |
Protein GI | 222478843 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.426667 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCTTGT CGTCTAGCGT GCCGGTGGCC GACCTCCTCC AGATACCGAT GCCCGGCGAC GGCGTCGTGC TCGCTCTCGG CGTCGCCGCG ATCCTCTTTT TGATCGGGCT GTCGGCGTTC TTCTCCTCGT CGGAGATCGC GATGTTCTCG CTACCGCAAC ACCGCGTCGA CAGCCTCGTC GACGAGGGCG TAAAAGGGGC AGAGACGATC CGCGGCATGA AACAGAACCC CCATCGCCTG TTGGTGACGA TCCTCGTCGG CAACAACATC GTCAACGTGG CGATGACCTC CATCGCGACC GCGCTGTTCG GGATCTACCT CTCGCGGGGG GAGTCGGTGC TGGCGACGAC GTTCGGCATC ACGACGCTCG TGTTGATCTT CGGCGAGAGC GCGCCGAAGT CGTACGCCGT CGAGAACACC GAGTCGTGGG CGCTCCGGAT CGCCCGCCCG CTGAAGCTCT CCGAGTACGC GTTGTACCCG CTCGTCGTCC TCTTCGATTA CATCGTCAAG GGTATCAACA AGATCATCGG TGGCTCGGCC GCCATCGAGT CGACGTACGT CACCCGTGAC GAGATCCAAG ACATCATCGA GACGGGCGAA CGCGAGGGCG TCATCGAGGA GGAGGAACGC GAGATGCTCG ACCGCATCTT CCGATTCAAC AACACCATCG CCAAGGAGGT GATGACGCCC CGTCTCGACG TCACCGCGGT GGCGAAGGAG TCCTCGGTCG AGGAGGCGAT CGAGACGTGC ATCCAAGCGG ACCACGAGCG CGTCCCCGTC TACGAGGGGA ACCTCGACAA CATCATCGGC GTGGTGACCG TCCGGGATCT CGTCCGCGAA CTGCGCTACT CCGAGGGTGA GCCGTCGCTG GAGCGCGTCG TGAAGCCGAC GCTGCACGTC CCCGAGTCGA AGAACGCGGA CGAGCTGCTC GCGGAGATGC AGGACAACCG CCTCCAGATG GTCACCGTCA TCGACGAGTT CGGGACCACG GAGGGGATCA TCACCTTAGA GGACATGGTC GAGGAGATCG TCGGCGAGAT CTTGGAGGGC GACGAGGAGG CTCCGGTGGA GTTCTTAGAA GACAACGTCG CCGTCGTGCA GGGCGAGGTA AACATCGACG AGGTCAACGA GATGCTCGGG ATCGACCTCC CCGAGGGCGA GGAGTTCGAG ACGCTCGCCG GCTTCGTGTT CAACCGCGCC GGGCGCCTCG TCGAGGAGGG CGAGGAGATC GAGTTCGACG AGATCCGGAT CCGGATCGAG CGCGTGGACA ACACCCGGAT CATGTCCGCG CGGGTCACCG TGCTCGACGG CGCGGAGGCG GCCGACGTGG TCGCCGAGGA CGACGCGCTC GAGTCGAGCG GCGAGCCCGA GGCGCCTCCG AACGACGCGG AGTGA
|
Protein sequence | MGLSSSVPVA DLLQIPMPGD GVVLALGVAA ILFLIGLSAF FSSSEIAMFS LPQHRVDSLV DEGVKGAETI RGMKQNPHRL LVTILVGNNI VNVAMTSIAT ALFGIYLSRG ESVLATTFGI TTLVLIFGES APKSYAVENT ESWALRIARP LKLSEYALYP LVVLFDYIVK GINKIIGGSA AIESTYVTRD EIQDIIETGE REGVIEEEER EMLDRIFRFN NTIAKEVMTP RLDVTAVAKE SSVEEAIETC IQADHERVPV YEGNLDNIIG VVTVRDLVRE LRYSEGEPSL ERVVKPTLHV PESKNADELL AEMQDNRLQM VTVIDEFGTT EGIITLEDMV EEIVGEILEG DEEAPVEFLE DNVAVVQGEV NIDEVNEMLG IDLPEGEEFE TLAGFVFNRA GRLVEEGEEI EFDEIRIRIE RVDNTRIMSA RVTVLDGAEA ADVVAEDDAL ESSGEPEAPP NDAE
|
| |