Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0340 |
Symbol | |
ID | 7399732 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 362377 |
End bp | 363516 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643707404 |
Product | protein of unknown function DUF87 |
Protein accession | YP_002565014 |
Protein GI | 222478777 |
COG category | [R] General function prediction only |
COG ID | [COG0433] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0311557 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATGTGC TCGGACGCGA CACCGGCTCG GATGATGGGA CCGGGTCAGC CGGCACGAAC GGCGAGATCG GCGACGAACG GCTCCCAACC GTCCAGCTCG GGTCGTTCCT CGCACGCGAC GGCAGCGCCG GGGCCGCGGT CGGGATTGAC GCCGACAGCC CGCACGCCGG CGTCGTCTTC GGTAAGCGGG GCACCGGCAA GTCGTACACA CTCGGCGTCC TTGCGGAGGG GCTCGCGGCG GCCAGCGGCG TCGCGCCGGT TGTGGTCGAT CCAATGGGCG TCTTCGACGG GCTTCGGGCG ACTGGCGGAC AGGTCGTCGA ACCACGGGTC CGCCCAGCGG CGATTCCCCC AGAGGCGTGG CCGGACCTGC TCGGGCTCGA CCCGGCGAGC GGGCCGGGAA GTCTGGTGTG GCGCGTCGTC GCTGACGCCC TCAAATCCCC TGAGGCGGGG GGTTCGGGTG AATCGGACGA ATCGCCGTCG CTCGCGACGC TCCGCGATCG AGTCGACGCC GCAGACGCGC CCGCTGCAGA TCGCCGCGCG GCCGCAAACC ACCTGCGGCT CGCGGAGTCG TGGGGCGTGT TCGACGCGGA CGCACCGCCG ACCGTCCGGC TCGTCGGTGG CGGGGAGCCG ACCGTACTCG ATCTCGCCGG CGTTCCGGAG GCAGCCGCGG CTGCAGTCGT CAGGGCGGTC GCTCGCGGGC TCTACGACGC CCGGATCGAC GGCGACCTCG ATCGGCTCCC GTGGCTCCTC GTCGACGAGG CGCACGCTTT CTTCGGCGGC GTCGCTGATC CCGCGCTCCG AACGCTCCTG ACCCGTGGTC GCGCACCCGG CGTCTCGCTG GTCTGTGCGA CGCAGCGACC CGGTGCGCTG CCGAGCGTCG CCGTCTCGCA GTCGGACCTG CTCGTCGCCC ACCGGCTCAC CGCCGAGCGC GACCTCGACC GGCTCGCCGA GGCGGAGGCG ACCTACCTCG CCGGCGACCT CGCTTCCCGG CTCCCGACTG AAACCGGCGA GGCGCTCGTC GTCGACGACG CGACGGAGAC GGCTCACACA GTTCGGATCC GAGAGCGACG GACTCCACAC GGTGGCGGAA GTCCCAGCGC AAGCGGGACC GCCGCCGCGA AGTCCGAAGA CCCAAGATAA
|
Protein sequence | MYVLGRDTGS DDGTGSAGTN GEIGDERLPT VQLGSFLARD GSAGAAVGID ADSPHAGVVF GKRGTGKSYT LGVLAEGLAA ASGVAPVVVD PMGVFDGLRA TGGQVVEPRV RPAAIPPEAW PDLLGLDPAS GPGSLVWRVV ADALKSPEAG GSGESDESPS LATLRDRVDA ADAPAADRRA AANHLRLAES WGVFDADAPP TVRLVGGGEP TVLDLAGVPE AAAAAVVRAV ARGLYDARID GDLDRLPWLL VDEAHAFFGG VADPALRTLL TRGRAPGVSL VCATQRPGAL PSVAVSQSDL LVAHRLTAER DLDRLAEAEA TYLAGDLASR LPTETGEALV VDDATETAHT VRIRERRTPH GGGSPSASGT AAAKSEDPR
|
| |