Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2278 |
Symbol | |
ID | 8384575 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 2326430 |
End bp | 2328526 |
Gene Length | 2097 bp |
Protein Length | 698 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644973350 |
Product | CoA-binding domain protein |
Protein accession | YP_003131178 |
Protein GI | 257053345 |
COG category | [C] Energy production and conversion |
COG ID | [COG1042] Acyl-CoA synthetase (NDP forming) |
TIGRFAM ID | [TIGR02717] acetyl coenzyme A synthetase (ADP forming), alpha domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.244685 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCCAGT TAGATACGTT ATTTGCGCCG GAACGGGTAG CTGTCGTCGG AGCGACCGAG AGTGAGGGCT CCGTCGGGCG CGCAGTCATG GAGAATCTCC TTGACGGCTA TGAGGGGGAC GTCGTGGCGG TCAATCCGAG CTCGGAGACG GTGTTCGGTC TCGACTGTCA TGACAGTATC GCCGACGTCG GGGGCGTCGA TCTGGCGATC GTCGTCGTGC CGCCACAGAT CGTCAACGGG GTCCTCGAAG AGGCTGGCGA GGCCGGGGTC CGCGACGTTG CCGTGATCAC TGCCGGGTTC GGGGAGGCCG GCAGCGAGGG GGCCGCCCGG GAACGGGACA TGAAAGCGAT CGCCGAGGAC CACGATCTCA ATCTCGTCGG TCCGAACTGT CTGGGCGTCC TCTCGACGCG CCGGAACATG AACGCGACGT TCAGCCCGAA ATCGGCCATC GAGGGCAACA TCTCCTTCAT GAGCCAGTCG GGGGCGTTCA TCACGGCCGT CCTCGACTGG GCGAGCGATC ACAACGTCGG CTTCAACGAC GTCGTCTCAC TGGGTAACAA GGCCGTCCTC GACGAGGGGG ACTTCATCGA CTACTGGGGC GAGGACCCCG AGACGGACGT CATCCTCGGC TACCTCGAAG ACATCGAGAA CGGCCGGGAT TTCATCGACA CGGCACGCGA GGTCACCAAG GACACGCCGA TCGTCGCCGT CAAATCCGGA CGGACGGAAG CCGGCGCGTC GGCCGCGGCC TCTCACACCG GCGCGATCGC CGGCAGCGAA CGGGCCTACG AGGCCGGTCT GGAGCAGGCG GGGGTACTCC GGGCGACCAG CGTCCAGGAA CTGTTCGACT ACGCGACGAT CCTCGAAGAT CAGCCGATGC CCGAGAACGA CCAGATCGCG ATCGTCACGA ACGCCGGCGG CCCCGGCGTG ATGTCGACTG ACGCGATCGG TGACTCCGGG CTCGAGATGG CGACACTGAC CGACGAAACC CTCGAGGCAC TCGAAGCCGA CATGCCGGAC GGAGCCAACA TCTACAATCC CGTCGACGTC CTCGGGGACG CACCCAGCGA GCGCTACGAG CAAGCGCTCG ACGTGGTGTT GCAGGACCCC AACGTCGGCT CGGTGGTCGT CGTGGCCTGC CCCACGGCCG TGCTCTCCTT CGAGAAGCTG GCCGAAACGG TCACCGACAA GTTCGAGGAA TACGGCGTTC CCATGGCAGC GAGTCTCATG GGCGGGGACT CGGCCCAGCA AGCCAACGAG ATCCTCGGCG AGGCTGGCAT CCCGTCGTAC TTCGATCCCG CACGCGGCGT CAACGGTCTC GACGCGCTCC GGGAGTACGC CGAGATTCGC GAACACGAGT ACGCCGAGCC CCGGGACTTC GACGTCGATC GCGAGCGCGC TCGCGAGATC CTCGAACGGA CGAAGGAACG TGACACTAAC AAGCTCGGGG TCGAGGCCAT GGAACTGCTC GAAGCCTACG GGATCCCGAC CCCGCAGGGA GCCATCGTCT CTGACAAGAA CGCGGCGGTC GAAGCCGCCA AAGATATCCC GGGCAACGTC GTGATGAAGA TCGTCAGTCC GGACATCCTC CACAAATCGG ACATCGGTGG CGTCGAGGTC AGCGTCCCCG ACGACGAGGT CGCTAGCACC TACGACGATC TGATCGCTCG TGCGCGCAAC TACCAGCCCG ACGCGACGAT CCTCGGCGTT CAGGTCCAGG AGATGGCCGA CCTCGACGCG GGGACCGAGA CCATCGTCGG GATCAACCGC GACCCGCAGT TCGGACCGCT GGTGATGTTC GGGCTGGGAG GCATCTTCGT GGAAGTGCTC GAAGACGCCA CCTTCCGGGT CGCCCCCGTC AGCGAACCCG AGGCCGAGGA GATGATCGAC GAGATCGATT CCGCGCCGCT ACTCCGTGGG GCGCGTGGTC GCGAGCCGGT GGACGAAGCC GGCGTCGTCG AGACGATCCA GCGCATCTCG AAGCTCGTTA CTGACTTCCC AGCGATCCTC GAACTCGACA TCAACCCGCT CGTCGCGACG CCGGATGGCG TCGTAGCCGT CGACATCCGG GCGACCGTCG ACCAGGAGGA ACTTTAA
|
Protein sequence | MGQLDTLFAP ERVAVVGATE SEGSVGRAVM ENLLDGYEGD VVAVNPSSET VFGLDCHDSI ADVGGVDLAI VVVPPQIVNG VLEEAGEAGV RDVAVITAGF GEAGSEGAAR ERDMKAIAED HDLNLVGPNC LGVLSTRRNM NATFSPKSAI EGNISFMSQS GAFITAVLDW ASDHNVGFND VVSLGNKAVL DEGDFIDYWG EDPETDVILG YLEDIENGRD FIDTAREVTK DTPIVAVKSG RTEAGASAAA SHTGAIAGSE RAYEAGLEQA GVLRATSVQE LFDYATILED QPMPENDQIA IVTNAGGPGV MSTDAIGDSG LEMATLTDET LEALEADMPD GANIYNPVDV LGDAPSERYE QALDVVLQDP NVGSVVVVAC PTAVLSFEKL AETVTDKFEE YGVPMAASLM GGDSAQQANE ILGEAGIPSY FDPARGVNGL DALREYAEIR EHEYAEPRDF DVDRERAREI LERTKERDTN KLGVEAMELL EAYGIPTPQG AIVSDKNAAV EAAKDIPGNV VMKIVSPDIL HKSDIGGVEV SVPDDEVAST YDDLIARARN YQPDATILGV QVQEMADLDA GTETIVGINR DPQFGPLVMF GLGGIFVEVL EDATFRVAPV SEPEAEEMID EIDSAPLLRG ARGREPVDEA GVVETIQRIS KLVTDFPAIL ELDINPLVAT PDGVVAVDIR ATVDQEEL
|
| |