Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1775 |
Symbol | |
ID | 4711004 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 1947429 |
End bp | 1950650 |
Gene Length | 3222 bp |
Protein Length | 1073 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639856245 |
Product | carbamoyl-phosphate synthase, large subunit |
Protein accession | YP_001003341 |
Protein GI | 121998554 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0103005 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGAAAA GAACTGACAT CCAATCGATC CTGATCATCG GCGCCGGCCC CATCGTCATC GGCCAGGCGT GCGAGTTCGA CTACTCCGGT GCGCAGGCCT GCAAGGCCCT GCGCGAGGAG GGCTATCGGG TCATCCTGGT GAACTCCAAC CCGGCGACCA TCATGACCGA CCCGGAGACC GCCGACGCGG TCTATATCGA ACCGGTGGAG TGGCAGACGG TCTCGCGGAT CATCGAGCGG GAGCAGCCGG ATGCGGTGCT GCCCACCATG GGCGGGCAGA CGGCGCTGAA TTGTGCCCTC GATCTGGTCA AGCACGGCGT GCTGGAGCGA TACGGCGTCG AGATGATTGG CGCCAGCCGG GAGGCGATCG ACAAGGCTGA GGAGCGCGAG GCGTTTCGCG CGGCGATGGC CCGCATCGGT CTGGAGACGC CGCGGGCCGA GCTGGCCCGT TCCATGGCCG AGGCCCAGGC GGCGCAGGCA CGCATGGGCT TCCCGGTGAT CATCCGCCCC TCCTACACCC TGGGCGGCTC CGGCGGCGGC ATCGCCTACA ACCGCGAGGA GTTCAACGAG ATCGTCGAGC GCGGGCTCGA CCTCTCCTAC ACCAACGAGG TGCTCCTCGA GGAGTCGGTG CTCGGCTGGA AGGAGTATGA GATGGAGGTC GTGCGGGACC GGCACGACAA CGCCATCATC GTCTGCTCCA TCGAGAACCT CGACCCCATG GGGGTGCACA CCGGCGACTC CACCACCATT GCACCGGCGC AGACCCTGAC CGATAAGGAG TACCAGCTGA TGCGCGACGC ATCGCTGGCG GTGCTGCGCG AGATCGGTGT GGAGACCGGC GGATCCAACG TCCAGTTCGC CATCAACCCG GACAACGGGC GCATGGTGAT CATCGAGATG AATCCGCGGG TGTCGCGCTC CTCGGCGCTG GCCTCCAAGG CGACCGGCTT CCCCATCGCC AAGGTGGCGG CCAAGCTCGC CGTCGGTTAC ACGCTGGATG AGCTGCGCAA CGAGATCACC GGCGGGGCGA CGCCGGCCTC CTTCGAGCCG ACCATCGACT ACGTGGTCAC CAAGATCCCG CGTTTCACCT TCGAGAAGTT CCCCCAGGCC GAGTGCTACC TGACGACACA GATGAAGTCG GTGGGCGAGG TCATGGCCAT CGGGCGGACC TTCCAGGAGT CCTTCCAGAA GGCCCTGCGG GGTCTGGAGC AGGACCTTTC CGGGCTCGAC GAGCGGCTCG ATCGCAGCCG TCAGGATGTG CGCGATACGG TGCGCCACTC GTTGCGTCAG CCGACGCCCG AACGGGTCCT GCATCTGGGT GACGCCTTCC GGGTCGGCTT TACCCTGGAC GAGGTCCACG GGATGACGGC CATCGATCCG TGGTTCCTGG CGCAGATCGA GGAACTGATC GCCGTGGAGG GGCAGGTCGC CGCGAGCGCG CTGGACGATT GTGATGCCGG TGCCTTACTG CGGCTCAAGC GCCGCGGTTT CTCCGATGCC CGGTTGGCCA GCCTGTGGGG CGTTACCGAG GCGCAGGTGC GGCAGCGTCG CCGTGAGCTC GGCGTGCGGC CGGTGTTCAA GCGGGTGGAC TCCTGCGCCG CCGAGTTCCC CACCGCCACG GCGTACCTCT ACTCGACCTA CGAGGAGGAG TGCGAGGCCG AGCCCACCGG GCGCAAGAAG ATCATGGTCC TTGGTGGCGG CCCGAATCGC ATCGGCCAGG GGATCGAGTT CGACTACTGC TGTGTCCACG CATCGCTGTC GCTGCGCGAG GACGGCTACG AGACCATCAT GGTCAACTGC AACCCCGAGA CGGTCTCGAC GGACTACGAC ACTTCGGATC GGCTGTACTT CGAGCCGCTG ACCCTGGAGG ACGTGCTCGA GGTGGTGGAG ACCGAGCAGC CGGATGGGGT GGTCGTCCAG TACGGTGGGC AGACGCCGCT GAAGCTTGCC CGCGAGCTGG AGGCTGCCGG GACACCGATC ATCGGCACCA GCCCGGACTC CATCGACCTG GCCGAAGACC GGGAGCGGTT CCAGGAGCTC ATCGGACGGA TCGACCTGAT GCAGCCGCCG AACCGCACCG CCCGGACCGA GACCGAGGCG CTTCAGCTGG CTGCCGAGAT CGGTTACCCG CTGGTGGTGC GCCCTTCGTA CGTGCTCGGC GGGCGGGCGA TGGAGATCGT CTACGAGGAG AGCGAGCTGC GCCAGTACAT GAATGAGGCG GTGCGGGTCT CGCACAACTC GCCCGTCCTG CTTGACCGCT TCCTCGACGA CGCCGTGGAG GTGGATGTGG ACGCCGTCAG CGACGGCGAC CAGGTGGTCA TCGGCGGGAT CATGCAGCAC ATCGAGCAGG CCGGCGTCCA CTCCGGGGAC TCCGCCTGCT CCATCCCGCC CTACACCCTG GGGCAGGATG TGCAGGACCG GATTCGCGAG CAGGTGCGGC TGCTGGCCCG GGAGCTCGGT GTGGTCGGAC TGATGAACGT GCAGTTCGCC ATCCAGGGGC AGCGCATCTT CCTCCTCGAG GTCAATCCGC GTGCCTCGCG GACGGTGCCG TACGTCTCCA AGGCCTGCGG TGTGCCCCTG GCCAAGGTGG CTGCGCGGTG CATGGCCGGC CGGACGCTGG CCGAGCAGGG GGTGGTGAGC GAAGTCATTC CCAACTACTA TTCGGTCAAA GAGGCGGTCT TCCCGTTCCT CAAATTCCCC GGTGTCGATC CCATCCTGGG TCCGGAGATG AAATCTACCG GAGAGGTGAT GGGTATTGGC GCCTGTTTCG GAGAGGCCTA CGCCAAGGCG CAGCTGGCTG CGGGGGTGAC CCTGCCGCGG GGCGGCTGTG CCTTTGTCAG CGTGCGTGAA GTGGACAAGG AGGCAGCGGT GGAGGTGGCG CGGGACTTGG TCCGACGCGG TTTCCGCTTG ATCGCCACCC ATGGCACAGC GGCCGCCCTC GAAGAGGCGG GCCTGGAGGT GCGCCGGATC AACAAGGTCA TTGAGGGACG GCCGCATGTC GTGGACGCCA TCAAGAACGA CGAGATCGAC CTGATCGTGA ACACCACCGA GGGGCGGCAG GCCATCGCCG ACTCCTACTC GATCCGCCGC GAGGCGCTGC AGCGCAAGGT CTGTTACACG ACGACCATCG CGGGCGCTCG GGCGACGTGC CTGGCGCTGG ATCACATGAA GGACTGGGAG GCCCGCCCCC TCGATGCCCT GCACAGGGAG ATGACGGGAT GA
|
Protein sequence | MPKRTDIQSI LIIGAGPIVI GQACEFDYSG AQACKALREE GYRVILVNSN PATIMTDPET ADAVYIEPVE WQTVSRIIER EQPDAVLPTM GGQTALNCAL DLVKHGVLER YGVEMIGASR EAIDKAEERE AFRAAMARIG LETPRAELAR SMAEAQAAQA RMGFPVIIRP SYTLGGSGGG IAYNREEFNE IVERGLDLSY TNEVLLEESV LGWKEYEMEV VRDRHDNAII VCSIENLDPM GVHTGDSTTI APAQTLTDKE YQLMRDASLA VLREIGVETG GSNVQFAINP DNGRMVIIEM NPRVSRSSAL ASKATGFPIA KVAAKLAVGY TLDELRNEIT GGATPASFEP TIDYVVTKIP RFTFEKFPQA ECYLTTQMKS VGEVMAIGRT FQESFQKALR GLEQDLSGLD ERLDRSRQDV RDTVRHSLRQ PTPERVLHLG DAFRVGFTLD EVHGMTAIDP WFLAQIEELI AVEGQVAASA LDDCDAGALL RLKRRGFSDA RLASLWGVTE AQVRQRRREL GVRPVFKRVD SCAAEFPTAT AYLYSTYEEE CEAEPTGRKK IMVLGGGPNR IGQGIEFDYC CVHASLSLRE DGYETIMVNC NPETVSTDYD TSDRLYFEPL TLEDVLEVVE TEQPDGVVVQ YGGQTPLKLA RELEAAGTPI IGTSPDSIDL AEDRERFQEL IGRIDLMQPP NRTARTETEA LQLAAEIGYP LVVRPSYVLG GRAMEIVYEE SELRQYMNEA VRVSHNSPVL LDRFLDDAVE VDVDAVSDGD QVVIGGIMQH IEQAGVHSGD SACSIPPYTL GQDVQDRIRE QVRLLARELG VVGLMNVQFA IQGQRIFLLE VNPRASRTVP YVSKACGVPL AKVAARCMAG RTLAEQGVVS EVIPNYYSVK EAVFPFLKFP GVDPILGPEM KSTGEVMGIG ACFGEAYAKA QLAAGVTLPR GGCAFVSVRE VDKEAAVEVA RDLVRRGFRL IATHGTAAAL EEAGLEVRRI NKVIEGRPHV VDAIKNDEID LIVNTTEGRQ AIADSYSIRR EALQRKVCYT TTIAGARATC LALDHMKDWE ARPLDALHRE MTG
|
| |