Gene Hhal_2361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2361 
Symbol 
ID4709302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2587525 
End bp2591127 
Gene Length3603 bp 
Protein Length1200 aa 
Translation table11 
GC content71% 
IMG OID639856836 
Producturea amidolyase related protein 
Protein accessionYP_001003926 
Protein GI121999139 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG1038] Pyruvate carboxylase
[COG1984] Allophanate hydrolase subunit 2
[COG2049] Allophanate hydrolase subunit 1
[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00724] biotin-dependent carboxylase uncharacterized domain
[TIGR02712] urea carboxylase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCAGCA AGGTCCTCAT CGCCAACCGT GGTGCCATCG CCTGTCGGAT CATCCGCACC 
CTGCGCCGGC TGGGCGTGGC CTCGGTGGCC GTCTACTCCG AGGCGGACCG CCACTCCCTC
CACGTCCGCC AGGCGGACCA GGCGGTCTGC ATCGGCCCGC CGAGCGCCGC CGAGTCGTAC
CTGAACGACG GCGCCATCCT CGCGGCCGCG CAGCAGACCG GTGCCGAGGC CATCCACCCC
GGCTACGGCT TCCTGTCGGA GAACGACGCC TTCGCCGAGG CCTGCGAGGC GGCGGGCATC
GCCTTCATCG GACCGACGCC GCAGCAGATG CGCGACTTCG GCCTCAAGCA CACCGCCCGC
GCCCTGGCCG AGGCCAGTGA CGTGCCCATG CTGCCGGGGA CCGGACTACT CGACAGCATG
GACGAGGCCG TGGCGGAGGC CGCCCGGATC GGCTATCCGG TGATGCTCAA GAGCACCGCC
GGCGGCGGCG GGATCGGCAT GCAGCTCTGC CACGACGCAG CCGCCCTGCG CGAGGCCTAC
GACTCGGTGC GCCGGCTTTC GGCGAACAAC TTCTCCAACG ACGGGCTGTT CCTGGAGAAG
TACGTCGAGC ACGGGCGGCA CATCGAGGTG CAGCTCTTCG GCGATGGATG CGGCACGGTC
ATCGCCCTCG GCGAGCGCGA CTGCTCGCTG CAGCGGCGCA ACCAGAAGGT GGTCGAAGAG
ACCCCCGCCC CCGGCCTCGA CGCGGACACC CGCCAGGCCC TGCTGGACGC CGCCGAGCGG
TTGGGCCGGC AGGTGGCGTA CCGCAGCGCC GGGACCGTGG AGTACATCCT GGATGCCGAT
ACCGGGGCCT TCTACTTCCT GGAGATGAAC ACCCGGCTGC AGGTCGAGCA CGGAGTCACC
GAGGCGGTGA CCGGCATCGA CCTGGTCGGG TGGATGGTGG AAGCCGCCGC GGGCACCCTC
ACCGACCTCG CAGCCCGCCG CCCCGGGCAC CGCGGCCACG CCATCCAGGC CCGCCTCTAC
GCCGAGGACC CGGGCAAGGG CTTCCAGCCG GCCACCGGCC TGCTTACCGA GGTCCGCTTC
CCGGACGCGG CGCGCATCGA GACCTGGGTG GAGCGCGGCA GCGAGATCTC GCCGCACTAC
GACCCGATGA TCGCTAAGCT GATCGTCCAC GGGACGGATC GCGCCGATGC GCTGCAGCGC
CTGCGCCAGG CGCTGGGCGA GACCGCGCTG CACGGCGTCG AGACCAACCT GCCCTATCTG
CGCGCGATCG CCGAGGATGA AACCTTCGCC GCCGGCCGGG CCACCACCCG CTACCTGGAT
CGCTTCACCT ACCGCCCGGC AAGCATCGAT GTCCTCCAGC CCGGCACCCA GACCACGGTC
CAGGACTGGC CGGGGCGGGT GGGTTATTGG GAGGTGGGCG TCCCGCCCTC GGGCCCCATG
GACGCCCTGG CCTTTCGTCT GGGTAACCGC ATCGTCGGCA ACCCGGAAGG GGCCGCCGGT
CTGGAGATGA CCGCCAGCGG CGCCACCCTG CAGTTCAACA CCGCCACCAC CATCGCCCTG
ACCGGCGCGA CCATGCCCGC CGAGCTGGAC GGCAGCCCTG TCCCACCCTG GCAGGCGGTG
GCCGTGCCCG CCGGGGCCCG GCTCCGGCTC GGCAGCGCCG CAGGCCCGGG GGTGCGCACC
TACCTGCTGC TCCGCGGCGG GGTCGACGTC CCCTTGCACC TGGGCAGCCG CGCCACCTTC
ACGCTCGGGC GGATGGGCGG CCACGGCGGC CGCGCCCTGC AGACCGGTGA TGTCCTGCAC
CTCGGCCCGG AACCGGACCG GCCGCTCACC GCCCTGCCCG CCAACGGCGT CCCTGCCTAC
GGCGAAGAGT GGACCCTCCG CGCCGTCTAC GGCCCCCACG GCGCCCCGGA CTTCTTCACC
GAGGCGGATA TCGACACGTT CTTCACCACC GCCTGGCAGG TGCACTACAA CTCCAGCCGC
ACCGGCGTGC GGCTGATTGG CCCCAAACCG GAGTGGGCAC GGGCGGACGG CGGTGAGGCG
GGCCTGCATC CATCGAACAT CCACGACAAC GCCTACGCCA TTGGCACCGT GGACTTCACC
GGCGACATGC CGGTGATCCT CGGCCCCGAC GGCCCGAGCC TGGGCGGCTT CGTCTGTCCG
GCCACCATCG TCACCGCCGA CCTGTGGAAG ATCGGCCAGC TGCGCCCCGG CGATCGGGTC
CGCTTCCAGG CGGTTGATGC GGCCACCGGA CAGCGTCTGG CCGAGGCGCA GGATGCGGCC
ATCGCCCAGG GCGCCCCGGC CGACCCGCCG CTACCCGACT CCGTGTCCGG GCGGGTGGCG
CCGACCCTGC GCCCCCAGGC CACCGCGGCC GAAGCGGTGA CCGTCACCTA CCGCAGCGCC
GGGGAGCGCT ACCTGCTGGT GGAGTACGGC CCCATCGTGC TGGACCTGAA CCTGCGCTTC
CGTGTCCAGG CCCTGCTGGA GTGGCTGAGC GAGCAGGCCA TCCCCGGCAT CCTGGAGATG
ACTCCCGGGG TGCGCAGCCT GCAGATCCAC TACGAGCCAC GCCGCCTGCC CCAGGAGCGG
CTGCTGACGA TCCTCGAGGA GGCCGAGGGG GAGCTGCGCG ACCTCGCCGA GGCCGAGATG
CCCTCACGTA TCGTCCACCT CCCGCTGGCC TGGGACGACT CCCAGACCCG GCTGGCCACG
GAGAAGTACA TGCAGTCGGT GCGCGCCGAT GCCCCCTGGT GCCCCCATAA CATCGAGTTC
ATTCGGCGCA TCAACGGCCT GCCGGACGAG GCCGCGGTCA AGCGGACCGT CTACGACGCC
TCGTACCTGG TCATGGGCCT CGGCGACGTC TACCTCTCGG CCCCCCTGGC CACCCCGGTG
GATCCGCGCC ACCGGCTGGT GACCACCAAG TACAACCCGG CCCGCACCTG GACACCGGAG
AACGCCGTGG GGATCGGCGG CTCGTACCTG TGCATCTACG GCATGGAGGG GCCCGGGGGC
TACCAGTTCG TGGGCCGCAC ACTGCAGATC TGGAATCGCT ACCACGTCAC CCCGGCCTTC
GAGAAACCCT GGCTGCTGCG CTTCTTCGAT CAGATCCGCT TCTATGAGGT CAGCGAGGCG
GAACTGCTGG AGATGCGCGA CGCCTTCCCC CGCGGCGGAC TGGCGCTGGA GATCGAGGAG
ACCACCTTCT CGCTGCAGCG CTATAACCGC TTCCTGCGCG AGAACCAGGC CTCCATCGAG
GCGTTCAAGG CCCACCAGCA GCAGGCCTTC GAGGCCGAGC GGCAGCGCTG GATCGCCAAC
GGCCAGGCCG ACTACGAAGC CGAGAGCGAG CCGCCCCCGG CAGCGGGTGA AGGCGTCGAG
CTCGGCGCCG ACGAGCAGGC CGTCGCGGCC CATGTGCACG CCAACCTCTG GTCGCTGCAG
GTGGCGGAGG GCGAGACCGT GGAGGCCGGC CAGACCCTGC TGGTCCTGGA GTCGATGAAG
ATGGAGATCC CCCTCTGCGC CGATCAGGCG GGCACCGTGC GCCGGCTGCT GTGCCGCGAA
GGGGCCCAGG TGGCCCCGGG ACAGACCCTG CTGACCCTGA CCAGCGACGG GAGTACACCA
TGA
 
Protein sequence
MFSKVLIANR GAIACRIIRT LRRLGVASVA VYSEADRHSL HVRQADQAVC IGPPSAAESY 
LNDGAILAAA QQTGAEAIHP GYGFLSENDA FAEACEAAGI AFIGPTPQQM RDFGLKHTAR
ALAEASDVPM LPGTGLLDSM DEAVAEAARI GYPVMLKSTA GGGGIGMQLC HDAAALREAY
DSVRRLSANN FSNDGLFLEK YVEHGRHIEV QLFGDGCGTV IALGERDCSL QRRNQKVVEE
TPAPGLDADT RQALLDAAER LGRQVAYRSA GTVEYILDAD TGAFYFLEMN TRLQVEHGVT
EAVTGIDLVG WMVEAAAGTL TDLAARRPGH RGHAIQARLY AEDPGKGFQP ATGLLTEVRF
PDAARIETWV ERGSEISPHY DPMIAKLIVH GTDRADALQR LRQALGETAL HGVETNLPYL
RAIAEDETFA AGRATTRYLD RFTYRPASID VLQPGTQTTV QDWPGRVGYW EVGVPPSGPM
DALAFRLGNR IVGNPEGAAG LEMTASGATL QFNTATTIAL TGATMPAELD GSPVPPWQAV
AVPAGARLRL GSAAGPGVRT YLLLRGGVDV PLHLGSRATF TLGRMGGHGG RALQTGDVLH
LGPEPDRPLT ALPANGVPAY GEEWTLRAVY GPHGAPDFFT EADIDTFFTT AWQVHYNSSR
TGVRLIGPKP EWARADGGEA GLHPSNIHDN AYAIGTVDFT GDMPVILGPD GPSLGGFVCP
ATIVTADLWK IGQLRPGDRV RFQAVDAATG QRLAEAQDAA IAQGAPADPP LPDSVSGRVA
PTLRPQATAA EAVTVTYRSA GERYLLVEYG PIVLDLNLRF RVQALLEWLS EQAIPGILEM
TPGVRSLQIH YEPRRLPQER LLTILEEAEG ELRDLAEAEM PSRIVHLPLA WDDSQTRLAT
EKYMQSVRAD APWCPHNIEF IRRINGLPDE AAVKRTVYDA SYLVMGLGDV YLSAPLATPV
DPRHRLVTTK YNPARTWTPE NAVGIGGSYL CIYGMEGPGG YQFVGRTLQI WNRYHVTPAF
EKPWLLRFFD QIRFYEVSEA ELLEMRDAFP RGGLALEIEE TTFSLQRYNR FLRENQASIE
AFKAHQQQAF EAERQRWIAN GQADYEAESE PPPAAGEGVE LGADEQAVAA HVHANLWSLQ
VAEGETVEAG QTLLVLESMK MEIPLCADQA GTVRRLLCRE GAQVAPGQTL LTLTSDGSTP