Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0144 |
Symbol | carB |
ID | 5055903 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 132820 |
End bp | 135897 |
Gene Length | 3078 bp |
Protein Length | 1025 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640467723 |
Product | carbamoyl phosphate synthase large subunit |
Protein accession | YP_001152411 |
Protein GI | 145590409 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGATA TTAGGAAAGT TCTCATAATT GGCTCAGGCG CCATAAAGGT GGCAGAGGCT GCGGAGTTCG ACTACTCGGG GTCGCAGGCT TTGAAGGCCT TTAGGGAGGA GGGGATATCA ACTGTGTTAG TAAATCCCAA CATCGCCACG ATACAGACGT CGAAGTTGCT TGCCGACCGC GTATATTTTG TGCCGATTGC CAGACATTTC CTGGAGCAGG TTATCGAGAG GGAGAGGCCC GATGCCATAG CCTGCGGCTT CGGCGGCCAG ACGGCGCTTT CTGCATGTGT TGAGCTATAC GACTCCGGCA TCTTGTCGAA ATACGGGGTT AGGGTAATAG GCACTCCAGT TAGGGGGATA AAACGGGCCT TGTCCAGGGA CCTCTTCCAG AAGGCCATGA AAGAGGCCGG CATTCCCGTT CCGCCTAGTA GCCCCGCCAA GTCGCCAGAG GAGGCTCTTG AGATCGCTAG GGGGCTGGGC TACCCCATCG TTGTGCGCGT CTCCTTCAAC CTTGGCGGAG CCGGGGCCTT CGTTGCGAGG AGCGAGGAGG CGCTGAGGGC GAGGATATAC AAGGCCTTCG CCCAGTCGGC CATTGGGGAG GTCCTCGTGG AGAAGTACCT AGAGGGCTGG AAGGAGGTGG AGTTCGAGGT GGTGAGAGAC GCCTACGACA ACGTAGCCGC CGTGGTGTGC ATGGAGAACG TGGACCCCAT GGGCGTCCAC ACAGGCGACT CCATTGTCGT GGCGCCGTGC CTCACACTAA CTGACGAGGA GTACCAGAAG GCTAGGGACA TCTCGATAGG GGTGGCCAGG TCGATTGAGC TGGTGGGCGA GGGCAACGTC CAAGTGGCGG TCAACTACGC CGGACCTGAG CAGTACGCCA TTGAGACCAA CCCCCGTATG TCCCGCTCAA GCGCCCTTGC CTCCAAGGCC TCTGGGTACC CCCTGGCGTA CATCGCGGCT AAGCTTGCCC TCGGCTACCG CCTGGACGAG GTGATGAACC AGGTGACGAG GCGGACGGTG GCCGCATTCG AGCCGTCGCT GGACTACATA GTTGTGAAGC ACCCGCGCTG GGAGAACGAC CGGTTCGGAG TTACGGAGGG CCTCGGCCCC GAGATGATGT CCATCGGCGA GGCGATGGGC ATAGGAAGGA CGCTGGAGGA GGCGTGGCAG AAGGCCATCC GCATGATAGA CATCGGCGAG CCGGGCTTGG TGGGAGGCCC CATGTTCGAA AGCCTCACGC TGGAGGAGGC CCTTAGGTGC GTGGAGAGGT ACTTGCCGTA CTGGCCCATA TGCGCGGCTA AGGCGCTCTA CCTTGGCGCG TCGGTGGAGG ACATATACCA GCGGAATAGA GTAGACAAGT TCTTCCTAAA CGCCATAAAA CGCGTCGTGG ATTCCTACAA AGGGCTTGAG GCCGGCTCCT ACGACCTCGA GGAGTTGAAG ATCTTGGGCT TCTCCGACGC CCAGATCGCC AAGGCCTTGA AGAAGCCCGT CGACGAGGTG AGGAGGGCGA GGAGGGCCCC CGTGGTGAAG AAGATAGACA CCCTAGCGGG GGAGTGGCCG GCGGATACCA ACTACCTCTA CCTAACCTAC GGCGGCCAAT ACGACGACGA GACGCCTAGG GCGGACTTCC TCGTGGTGGG GGCCGGCGTG TTCAGAATCG GCGTGTCGGT GGAGTTCGAC TGGGCCACGG TGAACTTGGC AAAGGAGCTG AGGGACAGGG GGTACCGCGT CGCGATTCTC AACTACAACC CCGAGACCGT CTCCACCGAC TGGGACGTGG TGGACAAGCT TTATTTCGAC GAGATAACGG CTGAGAGGGT GCTGGACATT GTGGAAAAGG AGGGCCGCGA CGTGGTAGTA GTCCTATACG CCGGGGGGCA GATAGGGCAG AGGCTATACG CCCCGCTTGA AAAGGCGGGT GTCAAAATCG GCGGCACCAA GGCGCGCTCT ATCGACGCGG CGGAGGACCG GAGCAAGTTC TCAAAGCTAC TTGACAGGCT CGGGATTAAG CAACCTCCCT GGCTCTACGC CTCCAGCGTC GAGGAGGCGG TGAAGCTGGC GGAGGATTTG GGATACCCCG TCTTGGTGAG GCCTAGCTAC GTCCTCGGCG GCACCTATAT GGCTGTGGCG AACAACGCGG AGGAGCTGAG AAGCTTCTTG GCAAAGGCGG CTAAGGTCAG CGGCGAGCAC CCAGTGGTGA TATCCAAGTT CATGCCCAGG GGGATAGAGG CGGAGGTAGA CGCGGTTTCA GACGGCGTGG GGATAGTGGC AACCCCAATC GAACACGTTG AGCCTCCTGG CATACACTCC GGCGACTCGA CCATGGTCCT GCCGCCGCGG AGGCTGGAGG AGTGGGCTGT GCGGAGGATG ATAGACATAG CCCACATCAT TGCCAGAGAG CTTGAGGTAA AGGGGCCTAT GAACGTCCAG TTTCTAGTAC AGGACGACGT CTATGTAATA GAGGCGAACC TCCGCGCTAG CCGGTCCATG CCACTGGTAA GCAAGGCCAC CGGCGTCAAC TACATGTCCC TAGTCGCAGA CGTCTTAGTC AACGGCCGCC TCGCGGTGGA CGAGGAGAGG GTGGTCTTAA AGCCCTCCAA GTGGTGGGTG AAGTCGCCCC AGTTCTCCTG GGCCCGCCTA AGAGGGGCAT ACCCGCGCCT CGGCCCCGTG ATGTACTCAA CAGGCGAGGT GGCCTCCAAC AGCGCTGTGT TTGAAGAGGC ATTGCTCAAA AGCTGGCTCT CCGCCACGCC CAACAGAATA CCGAAGAGGA ACGCCCTTGT CTATACCTAC GACCCCCATC ACGCCGAGCT GATCGGACAG GCGGCCAGCC TCCTCTCTGC CAAGCTTCGG GTATATTCAC CGGAGGAGCT GGGGGATAAA ATACTGGACG AGCTGAGGTG GCGCAGAATC GACATAGTAG TTACGGCGGG TACCACGCCC GAAAAGGACT ATCACATTAG GAGGACGGCG GCTGACACAA ACACGCCTCT TGTGCTGGAC TCTACCCTCG CCGTAGAGCT CGCAAAGGCC TTTCTCTGGT ATTATAAAAA CGGGAAACTA GGAGTAGAAC CATGGTGA
|
Protein sequence | MPDIRKVLII GSGAIKVAEA AEFDYSGSQA LKAFREEGIS TVLVNPNIAT IQTSKLLADR VYFVPIARHF LEQVIERERP DAIACGFGGQ TALSACVELY DSGILSKYGV RVIGTPVRGI KRALSRDLFQ KAMKEAGIPV PPSSPAKSPE EALEIARGLG YPIVVRVSFN LGGAGAFVAR SEEALRARIY KAFAQSAIGE VLVEKYLEGW KEVEFEVVRD AYDNVAAVVC MENVDPMGVH TGDSIVVAPC LTLTDEEYQK ARDISIGVAR SIELVGEGNV QVAVNYAGPE QYAIETNPRM SRSSALASKA SGYPLAYIAA KLALGYRLDE VMNQVTRRTV AAFEPSLDYI VVKHPRWEND RFGVTEGLGP EMMSIGEAMG IGRTLEEAWQ KAIRMIDIGE PGLVGGPMFE SLTLEEALRC VERYLPYWPI CAAKALYLGA SVEDIYQRNR VDKFFLNAIK RVVDSYKGLE AGSYDLEELK ILGFSDAQIA KALKKPVDEV RRARRAPVVK KIDTLAGEWP ADTNYLYLTY GGQYDDETPR ADFLVVGAGV FRIGVSVEFD WATVNLAKEL RDRGYRVAIL NYNPETVSTD WDVVDKLYFD EITAERVLDI VEKEGRDVVV VLYAGGQIGQ RLYAPLEKAG VKIGGTKARS IDAAEDRSKF SKLLDRLGIK QPPWLYASSV EEAVKLAEDL GYPVLVRPSY VLGGTYMAVA NNAEELRSFL AKAAKVSGEH PVVISKFMPR GIEAEVDAVS DGVGIVATPI EHVEPPGIHS GDSTMVLPPR RLEEWAVRRM IDIAHIIARE LEVKGPMNVQ FLVQDDVYVI EANLRASRSM PLVSKATGVN YMSLVADVLV NGRLAVDEER VVLKPSKWWV KSPQFSWARL RGAYPRLGPV MYSTGEVASN SAVFEEALLK SWLSATPNRI PKRNALVYTY DPHHAELIGQ AASLLSAKLR VYSPEELGDK ILDELRWRRI DIVVTAGTTP EKDYHIRRTA ADTNTPLVLD STLAVELAKA FLWYYKNGKL GVEPW
|
| |