Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Afer_1699 |
Symbol | |
ID | 8323790 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidimicrobium ferrooxidans DSM 10331 |
Kingdom | Bacteria |
Replicon accession | NC_013124 |
Strand | - |
Start bp | 1784117 |
End bp | 1787419 |
Gene Length | 3303 bp |
Protein Length | 1100 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644952830 |
Product | carbamoyl-phosphate synthase, large subunit |
Protein accession | YP_003110288 |
Protein GI | 256372464 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.54095 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTAGGG ATCCGAAGGT GGAGTCGGTG CTCGTCATCG GGTCTGGTCC CATCGTCATC GGCCAGGCCA GCGAGTTCGA CTACTCCGGC GTACAGGCGT GCCGGGTCCT TCGAGAGGAA GGGCTCCGCG TCATTTTGGC GAACTCGAAT CCAGCGACGA TCATGACCGA TCCCGAGTTC GCCGACGCGA CCTACATCGA GCCCCTGACG CTCGAGGTGC TCGAGCGCAT CATCGAGGCC GAACGGCCTG ATGCGGTGCT CCCGACGCTC GGCGGCCAGA CGGCGCTGAA TCTCGCGATG GAGCTGGATG CGTCTGGTGT GCTCGAGCGA AGTGGGGTGC GGATGCTCGG GGCGCGACCG GCCTCGATCG AGCTCGCGGA GAACCGCGAT GCATTCCGCC AGCTGCTCAT GAGCATCGAC GAGCAGCTGG CGGTGCGCGG GCGGCTCGTG CGGTCCCTCG AGGAGGGAAG GGACGTCGCC GACGAGCTCG GTTACCCGCT CATGCTGCGA CCCTCGTACA TCCTCGGCGG AGCCGGAACC GGGATCGCGA CCGACCCCAG CTCCTTCGAG GCGATGCTGC GGGCCGGTCT CATCGCCTCG CCGGTCGGCG AGGTGTTGGT CGAGGAGTCG ATCGCCGGTT GGAAGGAGTT CGAGCTCGAG GTCATGCGCG ACGCGAATGA CAACTGCGTC GTCGTGTGTT CGATCGAGAA CGTCGACCCG ATGGGGGTTC ACACTGGTGA CTCGATCACC GTTGCTCCTG CACAGACGCT GACGGATCTC GAGTACCAGC GCATGCGGTC GCTCTCGTTC GAGATCCTCC GACGCGTGGG TGTCGAGACG GGCGGCTCGA ACGTGCAGTT CGCTGTGGAG CCGACCAGCG GCCGCATGGT CGTGGTCGAG ATGAACCCCC GTGTGTCGCG CTCGAGCGCG CTGGCATCGA AGGCCACCGG GTTCCCGATC GCCAAGATCG CGACACGCTT GGCGATCGGC TACACGCTCG ACGAGATCAT GAACGACATC ACGGGTGTCA CGCCAGCCAG TTTCGAGCCG GCACTCGATT ACGTGGTGGT CAAGGTGCCG CGCTGGGTCT TCGAGAAGTT CGAAGGTGCC GAGGGTATTC TCGGGACGCG GATGCAGTCG GTCGGGGAGA CCATGGCGAT CGGCCGTAGT TTCGCCGAGG CGCTGCAGAA GGCGCTCCGC GGTATCGAGC GGTCGCGGGG AGGGTTCGGC GCCGATCCGG CCGAGGTCAC CTGGCAGGCG TACTCCGATG ACGCCCTGGC CACGCTCGTC GCTGTGCCGA CCGAGCAGCG AGTCTTCGCC GTTGGCGAGG CACTCCGGCG CGGTTGGAGC ATCGAGCGCG TTGCCGAGCT GAGCCGGATC GACCCGTGGT TCATCGGGGA AATGGCGGGC ATCGTCGCGC GTGCAGCCGA CATCCGTGGC CGCGATCTCG CCTCACTCGG TGCCGACGAA CTGCTCGATC TGAAGCGATG GGGCTTCTCG GATCTCCAAC TCGCGTGGCT GCTCGGTGTG GACGAGACGG CGGTTCGCGA GCATCGGCAC ACCGTAGGGG TGCGCGCTGT CTACAAGGCG GTCGACACCT GCGCTGGCGA GTTCCCAGCA CGCACGCCGT ACTACTACGG GACCTACGAG GAGGAGAGCG AGACCGTCGG ATCGAATCGG CCAAGTGTGA TCATCATCGG CGCTGGCCCG AATCGCATCG GCCAGGGCAT CGAGTTCGAC TACTGCTGCG TCCATGCAGC ATTCGCGCTT CGTGAGGCCG GTGTCGATGC GATCATGGTC AACTCGAACC CGGAGACGGT CTCGACCGAC TACGACACCT CCTCGCGCCT GTACGTGGAG CCCTTGGTGA CCGAACACGT GCTCGACGTG ATCGCCGAGG AACAGCGTCT CGGCTCACTC CAGGGTGTCA TCGTGTCGCT CGGAGGCCAG ACACCACTCA AGCTGGCTCG AGACATCGAT CCGTCGCTGG TCCTCGGCAC TTCGCCGGAC TCGATCGACG TCGCGGAGGA CCGCCGGCGT TGGTCTGCCC TGTGCGAGCG CCTCGGCATC CGCCAGCCCC CGGGCGGCAC CGTGACCTCG CTCGCCGAAG CGGAGGCGGT GGTCGCTGCG ATCGGCCTGC CCGTGTTGGT GCGCCCGAGC TATGTGCTCG GCGGGCGAGC CATGGAGATC GTCTACTCGG AGGACGAGCT GCGCAGCGCG TTCTCTCGGC TCGTCGATCT GGCGGCCGAG GGCGCGATCT CGCAGGACCG ACCGATCCTG ATCGATCGCT TCCTCGAAGG TGCGATCGAG GTCGACGTCG ACGCGGTTCG CGATCGCGAG GGCGCGTGCT GGATCGGTGC GGTCATGGAG CACGTCGAGG AGGCTGGCGT TCACTCGGGT GACTCGGCGT GCACGATCCC GCCGGTCTCA CTTGCTCCCG CCCTCGTCGC AGAGATCGAG GCCCAGACGC GTGCGATCGC GGACGCACTC GATGTGGTCG GGCTCATCAA CGTACAGTTC GCCGTGGCGG ACGGCACGGT GTTCGTCATC GAGGCCAATC CGCGTGCGTC GCGGACGGTT CCGTTCGTCG CCAAGGCGAC GGGTGTCGCG CTCGTCAAGA TCGCCACGCG TCTCATGCTG GGATCGACCC TGCGTGATCT CGAACGAGAG GGCCTCTTCG TGCCTCGCCG CGTCACCAGT TACGTCGCGG TGAAGGAGGC GGTCCTCCCG TTCGGCAGGT TCCGCGGGGC CGACAGCATC CTCGGGCCCG AGATGCGCTC GACCGGTGAG GTGATGGGGA TCGATCGCAC GATGCCGATG GCGTTCGCCA AGGCACAACT CGCCGCCGGT ACGCGTCTTC CCCAGCAGGG CACGGTCCTC GTGACCGTCG CCGATCGCGA CAAGGAGGCG CTCGTCCCGA TCGCCGAGCG ACTGGTGGCC CACGGGTTCG ACCTTGCTGC GACCGAGGGG ACCGCCTCCT GGCTTCAGGC TCACGGCGTC CCTGTTGCGC GCCTCGTCGC GAAGGTCACC GAGGAAGAGG ATCTCGGCGA GCGTGCTGCG CCGGAGGGTT TCGTCGATGC CGTACGGCTG GTGCGCTCCG GCGAGGTTGC GCTCATCATC AACACGCCGA GAGGGAGTGG CCCGCGGCGA GACGGTTACC GCATCCGCAC GGCAGCGCTC GAAGCCAAGA TCCCGTTGAT CACGACGCTG GAGGCCGCTC GCGCGGCTGT GGCAGCGGTC GAGGCGCTCG CACGTCAGCC GCTGACCGTG CGGCCGCTGC AGGCCTATCA CGAGGAAACC TGA
|
Protein sequence | MPRDPKVESV LVIGSGPIVI GQASEFDYSG VQACRVLREE GLRVILANSN PATIMTDPEF ADATYIEPLT LEVLERIIEA ERPDAVLPTL GGQTALNLAM ELDASGVLER SGVRMLGARP ASIELAENRD AFRQLLMSID EQLAVRGRLV RSLEEGRDVA DELGYPLMLR PSYILGGAGT GIATDPSSFE AMLRAGLIAS PVGEVLVEES IAGWKEFELE VMRDANDNCV VVCSIENVDP MGVHTGDSIT VAPAQTLTDL EYQRMRSLSF EILRRVGVET GGSNVQFAVE PTSGRMVVVE MNPRVSRSSA LASKATGFPI AKIATRLAIG YTLDEIMNDI TGVTPASFEP ALDYVVVKVP RWVFEKFEGA EGILGTRMQS VGETMAIGRS FAEALQKALR GIERSRGGFG ADPAEVTWQA YSDDALATLV AVPTEQRVFA VGEALRRGWS IERVAELSRI DPWFIGEMAG IVARAADIRG RDLASLGADE LLDLKRWGFS DLQLAWLLGV DETAVREHRH TVGVRAVYKA VDTCAGEFPA RTPYYYGTYE EESETVGSNR PSVIIIGAGP NRIGQGIEFD YCCVHAAFAL REAGVDAIMV NSNPETVSTD YDTSSRLYVE PLVTEHVLDV IAEEQRLGSL QGVIVSLGGQ TPLKLARDID PSLVLGTSPD SIDVAEDRRR WSALCERLGI RQPPGGTVTS LAEAEAVVAA IGLPVLVRPS YVLGGRAMEI VYSEDELRSA FSRLVDLAAE GAISQDRPIL IDRFLEGAIE VDVDAVRDRE GACWIGAVME HVEEAGVHSG DSACTIPPVS LAPALVAEIE AQTRAIADAL DVVGLINVQF AVADGTVFVI EANPRASRTV PFVAKATGVA LVKIATRLML GSTLRDLERE GLFVPRRVTS YVAVKEAVLP FGRFRGADSI LGPEMRSTGE VMGIDRTMPM AFAKAQLAAG TRLPQQGTVL VTVADRDKEA LVPIAERLVA HGFDLAATEG TASWLQAHGV PVARLVAKVT EEEDLGERAA PEGFVDAVRL VRSGEVALII NTPRGSGPRR DGYRIRTAAL EAKIPLITTL EAARAAVAAV EALARQPLTV RPLQAYHEET
|
| |