Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1834 |
Symbol | |
ID | 7400026 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 1843371 |
End bp | 1844492 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643708904 |
Product | cobalamin synthesis protein P47K |
Protein accession | YP_002566483 |
Protein GI | 222480246 |
COG category | [R] General function prediction only |
COG ID | [COG0523] Putative GTPases (G3E family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.311808 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.553805 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGACG ACCGGATTCC CGTGACCGTT CTCAGCGGGT ACCTCGGCGC CGGGAAGACG ACGATGGTGA ACCACCTGCT CGCGAACCCC GGCGACAGGC GGATCGCCGT CATCCTGAAC GACATGGGCG AGGTGAACGT CGACGCCGAG CTGGTCGCGC GAGAGAACGA TGAGGAGGGG ATCGTCGACC TCTCGAACGG CTGCATCTGC TGTCGGCTGC AGGACGACCT GCTGAGCGAG GCCGCGACGC TCGCGGAGTC CCGCGAGTTC GACTACCTCC TCGTGGAGTC GTCGGGCATC TCCGAGCCGA TCCCGATCGC GCGGGCGTTC ACGGAGGGGA CCGAGGACTC CGATATCGAC CCGACCGAGC GCTTCCGGCT CGACACGATG GTCACCGTCC TCGACACCTA CGGCTTCTGG AAGGAGTTCG ACGCCGGTGA GAACCTGCCG GGCGACGGGG AGTCGACCGA GGACCGCCCC CTCGCGGACG TGCTCGTCGA GGGGATCGAG TTCTGCGACG TGCTCGTGAT GAACAAGACG GATATGGTTC CCGACGACGT GCTCGATGAG ATCGAGTCGG TCGCCGAGCG GCTCGGCCCG CGGGCGAAGC GGATCCGGAC GAGCTACTCC GAGGTCAACC CTGACGAGGT GCTCGACACC GGACGGTTCG ACTTCGAGAC GGCCACGCGG TCGCCGGGGT GGAAGCGCGA GATTGCGGAA AGCGAGGGCG AACACGGTCA CGAGCCAGCA GCCGGCCAGC ACGAACACGA TCACGCCGAG GGCGCCGCCG CGGCCCACGG CGTCGACTCG TTCGTCTACC GCTCGGCCGA CGCCCTCGAT CCCGAGTCGT TCGCGAACTG GCTCCACGAC TGGGACGGCG CGATTGTGCG GGCGAAGGGT GTCGCCAACG TCGCGGGGAC GGACGAGGTC ATCGGCGTGA GTCAGGCCGG TCCCTCGGTG CAGGCGGGAC CGATCGGCGA GTGGGGGCCC GACGACGACC GCCGCACGCG ACTCGTGTTC ATCGGCAGCG AGATGGACGA AGCGCGGATC CGCGGAGAGC TCGACGAACT GGCGGCGTCG AACCCTGAAT CGGTCGAGAA CGCGGACGCG TTCCCGCTCT GA
|
Protein sequence | MSDDRIPVTV LSGYLGAGKT TMVNHLLANP GDRRIAVILN DMGEVNVDAE LVARENDEEG IVDLSNGCIC CRLQDDLLSE AATLAESREF DYLLVESSGI SEPIPIARAF TEGTEDSDID PTERFRLDTM VTVLDTYGFW KEFDAGENLP GDGESTEDRP LADVLVEGIE FCDVLVMNKT DMVPDDVLDE IESVAERLGP RAKRIRTSYS EVNPDEVLDT GRFDFETATR SPGWKREIAE SEGEHGHEPA AGQHEHDHAE GAAAAHGVDS FVYRSADALD PESFANWLHD WDGAIVRAKG VANVAGTDEV IGVSQAGPSV QAGPIGEWGP DDDRRTRLVF IGSEMDEARI RGELDELAAS NPESVENADA FPL
|
| |