Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_2361 |
Symbol | |
ID | 4709302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 2587525 |
End bp | 2591127 |
Gene Length | 3603 bp |
Protein Length | 1200 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639856836 |
Product | urea amidolyase related protein |
Protein accession | YP_001003926 |
Protein GI | 121999139 |
COG category | [C] Energy production and conversion [E] Amino acid transport and metabolism [I] Lipid transport and metabolism |
COG ID | [COG1038] Pyruvate carboxylase [COG1984] Allophanate hydrolase subunit 2 [COG2049] Allophanate hydrolase subunit 1 [COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit |
TIGRFAM ID | [TIGR00724] biotin-dependent carboxylase uncharacterized domain [TIGR02712] urea carboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCAGCA AGGTCCTCAT CGCCAACCGT GGTGCCATCG CCTGTCGGAT CATCCGCACC CTGCGCCGGC TGGGCGTGGC CTCGGTGGCC GTCTACTCCG AGGCGGACCG CCACTCCCTC CACGTCCGCC AGGCGGACCA GGCGGTCTGC ATCGGCCCGC CGAGCGCCGC CGAGTCGTAC CTGAACGACG GCGCCATCCT CGCGGCCGCG CAGCAGACCG GTGCCGAGGC CATCCACCCC GGCTACGGCT TCCTGTCGGA GAACGACGCC TTCGCCGAGG CCTGCGAGGC GGCGGGCATC GCCTTCATCG GACCGACGCC GCAGCAGATG CGCGACTTCG GCCTCAAGCA CACCGCCCGC GCCCTGGCCG AGGCCAGTGA CGTGCCCATG CTGCCGGGGA CCGGACTACT CGACAGCATG GACGAGGCCG TGGCGGAGGC CGCCCGGATC GGCTATCCGG TGATGCTCAA GAGCACCGCC GGCGGCGGCG GGATCGGCAT GCAGCTCTGC CACGACGCAG CCGCCCTGCG CGAGGCCTAC GACTCGGTGC GCCGGCTTTC GGCGAACAAC TTCTCCAACG ACGGGCTGTT CCTGGAGAAG TACGTCGAGC ACGGGCGGCA CATCGAGGTG CAGCTCTTCG GCGATGGATG CGGCACGGTC ATCGCCCTCG GCGAGCGCGA CTGCTCGCTG CAGCGGCGCA ACCAGAAGGT GGTCGAAGAG ACCCCCGCCC CCGGCCTCGA CGCGGACACC CGCCAGGCCC TGCTGGACGC CGCCGAGCGG TTGGGCCGGC AGGTGGCGTA CCGCAGCGCC GGGACCGTGG AGTACATCCT GGATGCCGAT ACCGGGGCCT TCTACTTCCT GGAGATGAAC ACCCGGCTGC AGGTCGAGCA CGGAGTCACC GAGGCGGTGA CCGGCATCGA CCTGGTCGGG TGGATGGTGG AAGCCGCCGC GGGCACCCTC ACCGACCTCG CAGCCCGCCG CCCCGGGCAC CGCGGCCACG CCATCCAGGC CCGCCTCTAC GCCGAGGACC CGGGCAAGGG CTTCCAGCCG GCCACCGGCC TGCTTACCGA GGTCCGCTTC CCGGACGCGG CGCGCATCGA GACCTGGGTG GAGCGCGGCA GCGAGATCTC GCCGCACTAC GACCCGATGA TCGCTAAGCT GATCGTCCAC GGGACGGATC GCGCCGATGC GCTGCAGCGC CTGCGCCAGG CGCTGGGCGA GACCGCGCTG CACGGCGTCG AGACCAACCT GCCCTATCTG CGCGCGATCG CCGAGGATGA AACCTTCGCC GCCGGCCGGG CCACCACCCG CTACCTGGAT CGCTTCACCT ACCGCCCGGC AAGCATCGAT GTCCTCCAGC CCGGCACCCA GACCACGGTC CAGGACTGGC CGGGGCGGGT GGGTTATTGG GAGGTGGGCG TCCCGCCCTC GGGCCCCATG GACGCCCTGG CCTTTCGTCT GGGTAACCGC ATCGTCGGCA ACCCGGAAGG GGCCGCCGGT CTGGAGATGA CCGCCAGCGG CGCCACCCTG CAGTTCAACA CCGCCACCAC CATCGCCCTG ACCGGCGCGA CCATGCCCGC CGAGCTGGAC GGCAGCCCTG TCCCACCCTG GCAGGCGGTG GCCGTGCCCG CCGGGGCCCG GCTCCGGCTC GGCAGCGCCG CAGGCCCGGG GGTGCGCACC TACCTGCTGC TCCGCGGCGG GGTCGACGTC CCCTTGCACC TGGGCAGCCG CGCCACCTTC ACGCTCGGGC GGATGGGCGG CCACGGCGGC CGCGCCCTGC AGACCGGTGA TGTCCTGCAC CTCGGCCCGG AACCGGACCG GCCGCTCACC GCCCTGCCCG CCAACGGCGT CCCTGCCTAC GGCGAAGAGT GGACCCTCCG CGCCGTCTAC GGCCCCCACG GCGCCCCGGA CTTCTTCACC GAGGCGGATA TCGACACGTT CTTCACCACC GCCTGGCAGG TGCACTACAA CTCCAGCCGC ACCGGCGTGC GGCTGATTGG CCCCAAACCG GAGTGGGCAC GGGCGGACGG CGGTGAGGCG GGCCTGCATC CATCGAACAT CCACGACAAC GCCTACGCCA TTGGCACCGT GGACTTCACC GGCGACATGC CGGTGATCCT CGGCCCCGAC GGCCCGAGCC TGGGCGGCTT CGTCTGTCCG GCCACCATCG TCACCGCCGA CCTGTGGAAG ATCGGCCAGC TGCGCCCCGG CGATCGGGTC CGCTTCCAGG CGGTTGATGC GGCCACCGGA CAGCGTCTGG CCGAGGCGCA GGATGCGGCC ATCGCCCAGG GCGCCCCGGC CGACCCGCCG CTACCCGACT CCGTGTCCGG GCGGGTGGCG CCGACCCTGC GCCCCCAGGC CACCGCGGCC GAAGCGGTGA CCGTCACCTA CCGCAGCGCC GGGGAGCGCT ACCTGCTGGT GGAGTACGGC CCCATCGTGC TGGACCTGAA CCTGCGCTTC CGTGTCCAGG CCCTGCTGGA GTGGCTGAGC GAGCAGGCCA TCCCCGGCAT CCTGGAGATG ACTCCCGGGG TGCGCAGCCT GCAGATCCAC TACGAGCCAC GCCGCCTGCC CCAGGAGCGG CTGCTGACGA TCCTCGAGGA GGCCGAGGGG GAGCTGCGCG ACCTCGCCGA GGCCGAGATG CCCTCACGTA TCGTCCACCT CCCGCTGGCC TGGGACGACT CCCAGACCCG GCTGGCCACG GAGAAGTACA TGCAGTCGGT GCGCGCCGAT GCCCCCTGGT GCCCCCATAA CATCGAGTTC ATTCGGCGCA TCAACGGCCT GCCGGACGAG GCCGCGGTCA AGCGGACCGT CTACGACGCC TCGTACCTGG TCATGGGCCT CGGCGACGTC TACCTCTCGG CCCCCCTGGC CACCCCGGTG GATCCGCGCC ACCGGCTGGT GACCACCAAG TACAACCCGG CCCGCACCTG GACACCGGAG AACGCCGTGG GGATCGGCGG CTCGTACCTG TGCATCTACG GCATGGAGGG GCCCGGGGGC TACCAGTTCG TGGGCCGCAC ACTGCAGATC TGGAATCGCT ACCACGTCAC CCCGGCCTTC GAGAAACCCT GGCTGCTGCG CTTCTTCGAT CAGATCCGCT TCTATGAGGT CAGCGAGGCG GAACTGCTGG AGATGCGCGA CGCCTTCCCC CGCGGCGGAC TGGCGCTGGA GATCGAGGAG ACCACCTTCT CGCTGCAGCG CTATAACCGC TTCCTGCGCG AGAACCAGGC CTCCATCGAG GCGTTCAAGG CCCACCAGCA GCAGGCCTTC GAGGCCGAGC GGCAGCGCTG GATCGCCAAC GGCCAGGCCG ACTACGAAGC CGAGAGCGAG CCGCCCCCGG CAGCGGGTGA AGGCGTCGAG CTCGGCGCCG ACGAGCAGGC CGTCGCGGCC CATGTGCACG CCAACCTCTG GTCGCTGCAG GTGGCGGAGG GCGAGACCGT GGAGGCCGGC CAGACCCTGC TGGTCCTGGA GTCGATGAAG ATGGAGATCC CCCTCTGCGC CGATCAGGCG GGCACCGTGC GCCGGCTGCT GTGCCGCGAA GGGGCCCAGG TGGCCCCGGG ACAGACCCTG CTGACCCTGA CCAGCGACGG GAGTACACCA TGA
|
Protein sequence | MFSKVLIANR GAIACRIIRT LRRLGVASVA VYSEADRHSL HVRQADQAVC IGPPSAAESY LNDGAILAAA QQTGAEAIHP GYGFLSENDA FAEACEAAGI AFIGPTPQQM RDFGLKHTAR ALAEASDVPM LPGTGLLDSM DEAVAEAARI GYPVMLKSTA GGGGIGMQLC HDAAALREAY DSVRRLSANN FSNDGLFLEK YVEHGRHIEV QLFGDGCGTV IALGERDCSL QRRNQKVVEE TPAPGLDADT RQALLDAAER LGRQVAYRSA GTVEYILDAD TGAFYFLEMN TRLQVEHGVT EAVTGIDLVG WMVEAAAGTL TDLAARRPGH RGHAIQARLY AEDPGKGFQP ATGLLTEVRF PDAARIETWV ERGSEISPHY DPMIAKLIVH GTDRADALQR LRQALGETAL HGVETNLPYL RAIAEDETFA AGRATTRYLD RFTYRPASID VLQPGTQTTV QDWPGRVGYW EVGVPPSGPM DALAFRLGNR IVGNPEGAAG LEMTASGATL QFNTATTIAL TGATMPAELD GSPVPPWQAV AVPAGARLRL GSAAGPGVRT YLLLRGGVDV PLHLGSRATF TLGRMGGHGG RALQTGDVLH LGPEPDRPLT ALPANGVPAY GEEWTLRAVY GPHGAPDFFT EADIDTFFTT AWQVHYNSSR TGVRLIGPKP EWARADGGEA GLHPSNIHDN AYAIGTVDFT GDMPVILGPD GPSLGGFVCP ATIVTADLWK IGQLRPGDRV RFQAVDAATG QRLAEAQDAA IAQGAPADPP LPDSVSGRVA PTLRPQATAA EAVTVTYRSA GERYLLVEYG PIVLDLNLRF RVQALLEWLS EQAIPGILEM TPGVRSLQIH YEPRRLPQER LLTILEEAEG ELRDLAEAEM PSRIVHLPLA WDDSQTRLAT EKYMQSVRAD APWCPHNIEF IRRINGLPDE AAVKRTVYDA SYLVMGLGDV YLSAPLATPV DPRHRLVTTK YNPARTWTPE NAVGIGGSYL CIYGMEGPGG YQFVGRTLQI WNRYHVTPAF EKPWLLRFFD QIRFYEVSEA ELLEMRDAFP RGGLALEIEE TTFSLQRYNR FLRENQASIE AFKAHQQQAF EAERQRWIAN GQADYEAESE PPPAAGEGVE LGADEQAVAA HVHANLWSLQ VAEGETVEAG QTLLVLESMK MEIPLCADQA GTVRRLLCRE GAQVAPGQTL LTLTSDGSTP
|
| |