Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_1729 |
Symbol | |
ID | 8742323 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 1798412 |
End bp | 1800244 |
Gene Length | 1833 bp |
Protein Length | 610 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646512307 |
Product | Carbamoyl-phosphate synthase L chain ATP- binding protein |
Protein accession | YP_003403287 |
Protein GI | 284165008 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit |
TIGRFAM ID | [TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTCGGA AGGTTCTCGT GGCGAACCGC GGGGAGATCG CGGTTCGAGT GATGCGCGCG TGCGAGGAGT TGAACATCGG GACCGTCGCC GTCTACTCCG AGGCCGACAA GGACTCGGGA CACGTCCGCT ACGCCGACGA GGCGTACAAC GTGGGGCCGG CCCGCGCGGC CGACTCGTAT CTCGATCACG AGGCCGTCAT CGAGGCCGCC CGGAAAGCCG ACGCCGACGC CATCCACCCC GGCTACGGCT TCCTCGCGGA GAACGCCGAG TTCGCGGGCA AGGTCGAGGA GGCGGAGGGC ATCACCTGGA TCGGTCCCTC GAGTTCGGCG ATGGAGTCGC TGGGCGAGAA GACCAAGGCC CGCACGATCA TGCGCGAGGC CGACGTGCCG ATCGTCCCCG GGACCACGGA CCCCGTCACC GATCCGGAAG CGGTCAAAGA GTTCGGCGAA GAACACGGCT ACCCGATCGC CATCAAGGCC GAGGGCGGCG GCGGCGGCCG CGGGATGAAG GTCGTCTGGG AGGAAAGCGA AGTCGAGGAC CAACTCGAGA GCGCCCAGCG CGAGGGCGAG GCCTACTTCG ATAACGATTC GGTCTACCTC GAGCGCTACC TCGAGCAACC CCGTCACATC GAGGTCCAGA TTCTGGCCGA CGAACACGGT AACGTCCGCC ATCTCGGCGA GCGGGACTGT TCGCTCCAGC GCCGTCACCA GAAGGTCATC GAGGAGGGGC CGTCGGCTGC CCTCTCGGAC GAACTCCGCG AGAAGATCGG CGAGGCCGCC CGCCGCGGCG TCGCTGCGGC CGACTACACC AACGCCGGCA CCGTCGAGTT CCTCGTCGAA GAGGAGGACC GCGAGGCGGG CGAACTGCTC GGTCCCGACG CGAACTTCTA CTTCCTCGAG GTCAACACGC GGATCCAGGT CGAGCACACG GTCACCGAGG AGATCACCGG GATCGACATC GTCAAGCGCC AGATTCAGGT CGCCGCCGGC GAGGAGATCG ACTTCGCACA GGACGACGTC GACATCGACG GCCACGCGAT GGAGTTCCGG ATCAACGCCG AGAACGCGGC CGAGGACTTC GCGCCCGCGA CTGGCGGAAC CCTCGAGACC TACGACCCGC CGGGCGGGAT CGGCGTCCGA CTCGACGACG CCCTGCGGCA GGGCGACGAC CTCGTCACCG ACTACGACTC GATGATCGCG AAACTGGTCG TCTGGGGCGA GGACCGCGAC GAGTGTATCG AGCGCTCGCT GCGCGCGCTG CGAGAGTACG AGATCGAGGG GATCCCGACG ATCATCCCGT TCCACCGGCT GATGCTCACC GACGAGGAGT TCGTCGCGAG CACGCACACG ACGAAGTACC TAGACGAGGA ACTCGACGAG ACCCGCATCG AGGAGGCCCA GGAACAGTGG GGCGGCGACA CCGGCGACGG AGCCGGCGAC GACGAGGAGT CCGTCGAACG CGAGTTCACC GTCGAGGTCA ACGGGAAGCG CTTCGAGGTC GAACTCGAGG AACACGGCGC GCCAGCCATC CCGGCCGGCG ACGTCGACGT CGGCGGCGGA CAGGCCGAAC GGCCACAACC CGGCGGCGGC TCCAGCGGCG GCGACGAACT CGAGGGCAGC GGCGAGACCG TCGACGCCGA GATGCAGGGC ACCATCCTCG ACGTCACGGT CGAGGTCGGC GACGAGGTCG CCGCCGGCGA CGTGCTGGTC GTCCTCGAGG CGATGAAGAT GGAAAACGAC ATCGTCGCCT CCAAGGGCGG CACTGTCACC GAGATCGCCG TCGAGGAAGA CCAGAGCGTC GATATGGGCG ATACGCTGGT CGTCCTCGAG TAA
|
Protein sequence | MFRKVLVANR GEIAVRVMRA CEELNIGTVA VYSEADKDSG HVRYADEAYN VGPARAADSY LDHEAVIEAA RKADADAIHP GYGFLAENAE FAGKVEEAEG ITWIGPSSSA MESLGEKTKA RTIMREADVP IVPGTTDPVT DPEAVKEFGE EHGYPIAIKA EGGGGGRGMK VVWEESEVED QLESAQREGE AYFDNDSVYL ERYLEQPRHI EVQILADEHG NVRHLGERDC SLQRRHQKVI EEGPSAALSD ELREKIGEAA RRGVAAADYT NAGTVEFLVE EEDREAGELL GPDANFYFLE VNTRIQVEHT VTEEITGIDI VKRQIQVAAG EEIDFAQDDV DIDGHAMEFR INAENAAEDF APATGGTLET YDPPGGIGVR LDDALRQGDD LVTDYDSMIA KLVVWGEDRD ECIERSLRAL REYEIEGIPT IIPFHRLMLT DEEFVASTHT TKYLDEELDE TRIEEAQEQW GGDTGDGAGD DEESVEREFT VEVNGKRFEV ELEEHGAPAI PAGDVDVGGG QAERPQPGGG SSGGDELEGS GETVDAEMQG TILDVTVEVG DEVAAGDVLV VLEAMKMEND IVASKGGTVT EIAVEEDQSV DMGDTLVVLE
|
| |