Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1468 |
Symbol | |
ID | 8383747 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 1441610 |
End bp | 1444306 |
Gene Length | 2697 bp |
Protein Length | 898 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644972531 |
Product | Phosphoenolpyruvate carboxylase |
Protein accession | YP_003130377 |
Protein GI | 257052544 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTTTAC ACGGCAGAGA GCTCACCCAG GACGTGCGCG AGCTGGGGGA ACTGCTGGGG ACCATCATCG AGGCACAGGA CTCGACGGAT GCCTACGAGA CCGTCGAGAC GATCCGGAAC AGCGCGATCG CGTACCGTCG TGGGGACGGC GAGTCCCGTG AGCCGATCCA CGACGAACTC GATCGACTCT CCCCGGAGAT GCAGGACGTC GTCGCCAGGG CCTTTACTAC CTACTTCGAA CTCATCAACC TCGCCGAGGA GCGCGAACGG GTCCGCGAGA TCAGGGAAGG CGTCCAGAGC GGCGACCTCT CGGACACCGT CGAGGAAGCC GTCGAGACGC TCGATGCCGA GGACGTCGAT CCGGACACCG TCGAGGAGAT TCTCGAGGAC GTCATGATCG AGCCGACGTT CACGGCCCAT CCGACTGAAG CGCGACGCAA GACGATCAAA GCCAAGCTCT GGTCGGTCGG GAAGATCATC CAGGATCTCG ATCAGATCCG GCTCACCGAT CGGGAGAAGC GACGGATGAA GCGCGAACTC CAGGCAGAAG TGACGAGCCT CTGGCAGACG CCCCAGGTTC GTGATCGCCG CCCCGACGTG ACCGACGAGA CGCTGAACAT CCAGTGGTAT CTCGAGAACA GTCTCTTCGA CATCATCAGT GAGGTCTACG ACGAACTCGA ACACGCCCTG TCGGAGACCT ACGATGGCGA GGTCGACGTC CCGAAACTCT ACGAGTTCCG GTCGTGGGCC GGCAGTGACC GTGACGGCAA TCCCTACGTC ACTCCCGAAG TCACGGAGGA AACCCTCGAG CGCCAGCGCA GCGTCGCGCT CGAACTGTAT CGGGACGATC TCAAGAGCCT CTCCGGCGTG TTGAGCCAGG ACGTCGAGAA CCTGGAGATG AGCGACGCGT TCGAGGAAAG CCACGAGGCA CACAAACAAC GCCTCCCGGG CGTTGCCGAC GGAATAGAGG AGCGATATCC CGACGAGCCC TATCGACAGA AGCTGAAGTT GATGCGTGAG AGTGTCCTCC GGGTTGATGA CGTCCGATCC GGTGCGTACG ACAACGACGA TGAACTCCTT GCCGATTTGG AGATCATCGC CGAGAGCCTC CGGGAGAACG ACGCGGACGA GATCGTTGAT GCGCACGTCA AGCCGCTGAT CAGGAAGGTC GACACCTTCG GGTTCTCGCT GGCGAACCTC GACCTGCGGG ATCACCGCGA GAAACACACG AACGCACTGG TCGAGACCCT CAAACGGGAG GGGATCGACT ACGAGTCGAT GTCCGAGGAC GAGCGCGTCG AGTTCCTGAC GGAAGCGATC CTCCAGGACG ACCCGGTCAT CGACATCGAA GACGACGAAG GGCTCTCCGA GGAGTCAGCC AAGGTCCTGA CGCTGTTCTC CGATGCCGCC GGGTGGCAAC GGGAGTACGG CGTCGACGCC ATCGACACCT ACGCGATCAG CTGGTTCGAG GAGTCCTCAC ACGCCCTGGA GGTCCTCTTC CTGGGCGATC AAGCCGGGAT CGTCGATCTC CCCGGCTACT GCGGGTTCGA CATCGTCCCG CTGTTCGAGA GCGAGTACGC CCTCACGCGC GTCCGGGAGA TGCTGGGGAC GCTCTTCGAG AACGAGGCCT ACAGCCAGGC CCTGGAAGCC CGGAACAACA CCCAGGAGAT CCTGCTGGGA TATTCGGACT CCTCGAAGGA GAACGGCTAT CTGGCCGCCA ACTGGGAACT CTACCGCAAC CAGAAGCGGA TGGCCAACAT CTGTGACGAC TTCGGCGTCA CCCTCCGGCT GTTCCACGGG CGGGGTGGTT CGATCTCCCG GGGCGGCGTC CCGATGCACG AGGCGATGCT CGCCTTGCCG AACCAGACCG TCAACGGCCA GATCAAGTTC ACCGAGCAGG GCGAGGCGAT CGCCGAGAAG TACGGCAACC CCGACATCGC CGAGCGCAAC CTCGAACAGA TGCTCAACGC GCAGGTCCTG GCGCGGTACA ACGCCATGGA AAACCCCGTC GAGGACATCC CCGAGGAGTG GATCGAGGCC TTCGAGACGG CGGCCGAGCA CGCCAGCGCG GAGTACCGGG ACCTCCTGGA GACCGACGGG TTCGTCGAGT ACTTCGAGCA GGCGACCCCG ATCACGGTCA TCGAGAACCT CAACATGGGC TCGCGGCCGG CCTCCCGGAC GGAGGATCGG AGTCTCGAAG ACCTCCGGTC GATCCCGTGG GTGTTCTCCT GGACGCAGGC CCGCTGTATC GTCCCCGGCT GGTTCTCCGT CGCCACGGGT ATCCAGGGCT ACCTGGACGA GGGCGGCGAC ATGGAGACGC TCAAGGAGAT GTACGAGGAG TGGCCGTACT TCGGGACGAT CCTCGATCAC GCCGGGATGG CGCTGGCCAA ATCCGACATG GAGATCGCCA CCGAGTACGC CGACCTGGCC GACGACGAGC TTCGAGAGCG GTTCTTCCCC TGGATCCGGA GCGAGTACGA GAACTCCGTC GAACTCATCC AGGAAATCTC CGGGCGCGAA ACGTTGCTCA ACCGATCGTG GATGGAGGAA AACCTCCAGC GCCGGAATCC GTACGTCGAT CCGCTCAACC TGCTGCAAAC CCGGTTGCTC GCCCAGTCAC ACCTCACAGA GACCGAACGC CGGGCGCTCC GGCTGACGGT CCACGGCATC GCCGCTGGCA TGAAGAACAC GGGATGA
|
Protein sequence | MTLHGRELTQ DVRELGELLG TIIEAQDSTD AYETVETIRN SAIAYRRGDG ESREPIHDEL DRLSPEMQDV VARAFTTYFE LINLAEERER VREIREGVQS GDLSDTVEEA VETLDAEDVD PDTVEEILED VMIEPTFTAH PTEARRKTIK AKLWSVGKII QDLDQIRLTD REKRRMKREL QAEVTSLWQT PQVRDRRPDV TDETLNIQWY LENSLFDIIS EVYDELEHAL SETYDGEVDV PKLYEFRSWA GSDRDGNPYV TPEVTEETLE RQRSVALELY RDDLKSLSGV LSQDVENLEM SDAFEESHEA HKQRLPGVAD GIEERYPDEP YRQKLKLMRE SVLRVDDVRS GAYDNDDELL ADLEIIAESL RENDADEIVD AHVKPLIRKV DTFGFSLANL DLRDHREKHT NALVETLKRE GIDYESMSED ERVEFLTEAI LQDDPVIDIE DDEGLSEESA KVLTLFSDAA GWQREYGVDA IDTYAISWFE ESSHALEVLF LGDQAGIVDL PGYCGFDIVP LFESEYALTR VREMLGTLFE NEAYSQALEA RNNTQEILLG YSDSSKENGY LAANWELYRN QKRMANICDD FGVTLRLFHG RGGSISRGGV PMHEAMLALP NQTVNGQIKF TEQGEAIAEK YGNPDIAERN LEQMLNAQVL ARYNAMENPV EDIPEEWIEA FETAAEHASA EYRDLLETDG FVEYFEQATP ITVIENLNMG SRPASRTEDR SLEDLRSIPW VFSWTQARCI VPGWFSVATG IQGYLDEGGD METLKEMYEE WPYFGTILDH AGMALAKSDM EIATEYADLA DDELRERFFP WIRSEYENSV ELIQEISGRE TLLNRSWMEE NLQRRNPYVD PLNLLQTRLL AQSHLTETER RALRLTVHGI AAGMKNTG
|
| |