Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_3545 |
Symbol | |
ID | 7402388 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012030 |
Strand | - |
Start bp | 293252 |
End bp | 294487 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643710083 |
Product | CBS domain containing protein |
Protein accession | YP_002567649 |
Protein GI | 222481413 |
COG category | [R] General function prediction only |
COG ID | [COG0517] FOG: CBS domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACATCT CGGAAATACT CTCGCCGAAG TTCACCGAGT TCGATATCGG AACCCCGCTT TCCAAAGTCG CCGGGGCGTT CGAAAATCAG GAACTCGACG CTGTCGTGGT GACAGACGGT GACGAGTATC GCGGCGTCGT TAGCCGCCGA CAGTTGGCCT CGTCGTCCAA CCAGCCGTCG GCGAAAGTTG GATCACGGGT ACAGCACGTC CCGACGGTTA ATCGGACTGC AGACGTCCGA GAGGTCGCCC GGCTCATGAT CGGCAGCGGC GCCAAAACGC TTCCCGTGTT GGACGATGAT CGCGTCGTCG GTATCGTCAC GGGTGACAGC ATCCTTGAGG CGGTTCAATC CTTCCTCAGC GCCGTAACGG TCGCGGATGC ATACACGGAG AAGCTGATCA GCGCGGCCCC CGACACGACG ATCGGTAAAG CCCTCAACAC ACTTCGAGAA GGTCGGATCG CACACCTCCC GGTCGTCGAC GACGGCGAAG CAGTTGGGAT GGTGAGCCTG TACGACATCG TCGATTTCAC GACGCGGGGC GGGACCAAGA GCCAGGGCGG CTCACCGGGA AACTTTGGGG GCCGTCACGG CGGTGAGCGA CACGGTGGCC TCGGAGCGCG TGAGGGAGAT TCCGATCGCA TGCTCGATCT ACCGGTGCGG AACCTGATGT CCGACGTCGT CGTCACGGTT CGACGAAGTG CCCCACTTGA CGAGGTCGTC GAAACGATGT TCGAGCGAGA GATCTCCTCG TTGGTGGTTC TCGGAGATCA GAGTAGTGAA CCCATTGGCG TAGTCACGAA AACAGACGTC CTCGAGGCGC TCACCTGGGA ACAGGAAGAC CGGAATGCAG TGCAGGTGTT CGGACTGGAC CTGCTGGATG GAATGGACTA TGATGGTGTC TCTGCGCTGA TCGAAAACAT GACCTCAAAG TATGGAGACA TGAGTGTGAT CAAAGCCAGC ATCGAGCTCC ATGAACACAA AGAACAGTCT CGAGGTATGC CGCTGGTGCT CGCACGGATT CGGCTGGTGA CTGACCGCGG CTATTTCACA GCCGATGGGG AGGGATACGG TGCCTCGCAC GCTCTCAGAC TGGCCGCGAA CAAAGTCGAG CGGCAAATAC TCAAGGGGAA AACATACCGG CGGTCGAAAA AGCGTCCTGA CAGCCGAGAG CAAGAGACCC TCTATGGGTG GTGGCTCACG GGTGATAGTT TCGATTTTCC GGAAGGAAAT GAGTGA
|
Protein sequence | MNISEILSPK FTEFDIGTPL SKVAGAFENQ ELDAVVVTDG DEYRGVVSRR QLASSSNQPS AKVGSRVQHV PTVNRTADVR EVARLMIGSG AKTLPVLDDD RVVGIVTGDS ILEAVQSFLS AVTVADAYTE KLISAAPDTT IGKALNTLRE GRIAHLPVVD DGEAVGMVSL YDIVDFTTRG GTKSQGGSPG NFGGRHGGER HGGLGAREGD SDRMLDLPVR NLMSDVVVTV RRSAPLDEVV ETMFEREISS LVVLGDQSSE PIGVVTKTDV LEALTWEQED RNAVQVFGLD LLDGMDYDGV SALIENMTSK YGDMSVIKAS IELHEHKEQS RGMPLVLARI RLVTDRGYFT ADGEGYGASH ALRLAANKVE RQILKGKTYR RSKKRPDSRE QETLYGWWLT GDSFDFPEGN E
|
| |