Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0042 |
Symbol | |
ID | 7401395 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 44396 |
End bp | 45406 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643707101 |
Product | hypothetical protein |
Protein accession | YP_002564718 |
Protein GI | 222478481 |
COG category | [S] Function unknown |
COG ID | [COG2339] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0273863 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCATCTC GGTCGGATCC GGTCGAGTCG CGCGCCGACG ACGACCGCGA CCTCTACGAC ATCGCCACGT GGGAGGAGCG TACCTCCCTC GACGGGCTCT CCGTCGCGCT GCACTGGCTC ATTACTCGGT CGGCGAAGGC GATCGTCGTC TTCGTCGCGC TCCTCGGGCT CCTCGCGATC CTCGGATCCT TCGGGCTCGG ACTCGTCTTC GACCCGGCGG TCGGGATGCT CGTCGGGCTC TCAGCGATCC CGGCGCTCGG GCTCGCAGCG TACGTGTACG TCTCCGACGT GACCACCGGA GAGCCGCTCT CCCTGCTGGT GGCGACGTTC CTCCTGTCGA TCCTGACCGC GACGTTCGCG GCACTCCTCA ACAGCGTCGC GCAGCCGTAC TTCCAGCCGT TCGGATTCCC CGGGCTCGTC CTCTTCTTCT TCGCGATCGT CGGTCCGATC GAGGAGTCGG TGAAGCTACT CGCGGTTCGG CTGTACGCGT ACACCGACGA CCGGTTCGAC GCGGTCATCG ACGGGGCGGT GTACGGCGCG ATCGCCGGGC TGGGGTTCGT CGTCATCGAG AACCTCGTGT ACATCGCGCA GACCGTCGAT CTGGGGGAGC TTTCGCTCAG TATCGCCACG CTCGGTGCGG GCGACGGGAT CGCCGCGCTG CGCGCGCTCG CCGGCCCCGG ACACGTCATC TACTCCGCGT TCGCGGGATA CTACCTCGGG CTCGCCAAAT TCAACCCTGG GAACCGGGGA CCGATCGTCG TGAAGGGGCT TATCATCGCT GCCGCGATCC ACGCGCTGTA CAACACGTTG GTCGGGCCAG TGACGACGGT GCTGTCGGTC GCGACCGGGC TCCCGCAGCT TGTCTCCCTG TTCGTCTTCG TGCTCCTGTT TCAGGGCGCA TTCGCGTACG TTCTCCTGCG GAAGCTCCGC CGATACCGAG ACGCGTACCT CGAGACGCGC GACGCGGTCG ATCCGGATGT CAAGCCCGAA ATGACCGAGT TCGAGGACTA A
|
Protein sequence | MPSRSDPVES RADDDRDLYD IATWEERTSL DGLSVALHWL ITRSAKAIVV FVALLGLLAI LGSFGLGLVF DPAVGMLVGL SAIPALGLAA YVYVSDVTTG EPLSLLVATF LLSILTATFA ALLNSVAQPY FQPFGFPGLV LFFFAIVGPI EESVKLLAVR LYAYTDDRFD AVIDGAVYGA IAGLGFVVIE NLVYIAQTVD LGELSLSIAT LGAGDGIAAL RALAGPGHVI YSAFAGYYLG LAKFNPGNRG PIVVKGLIIA AAIHALYNTL VGPVTTVLSV ATGLPQLVSL FVFVLLFQGA FAYVLLRKLR RYRDAYLETR DAVDPDVKPE MTEFED
|
| |