Gene Pars_0144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0144 
SymbolcarB 
ID5055903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp132820 
End bp135897 
Gene Length3078 bp 
Protein Length1025 aa 
Translation table11 
GC content60% 
IMG OID640467723 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_001152411 
Protein GI145590409 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGATA TTAGGAAAGT TCTCATAATT GGCTCAGGCG CCATAAAGGT GGCAGAGGCT 
GCGGAGTTCG ACTACTCGGG GTCGCAGGCT TTGAAGGCCT TTAGGGAGGA GGGGATATCA
ACTGTGTTAG TAAATCCCAA CATCGCCACG ATACAGACGT CGAAGTTGCT TGCCGACCGC
GTATATTTTG TGCCGATTGC CAGACATTTC CTGGAGCAGG TTATCGAGAG GGAGAGGCCC
GATGCCATAG CCTGCGGCTT CGGCGGCCAG ACGGCGCTTT CTGCATGTGT TGAGCTATAC
GACTCCGGCA TCTTGTCGAA ATACGGGGTT AGGGTAATAG GCACTCCAGT TAGGGGGATA
AAACGGGCCT TGTCCAGGGA CCTCTTCCAG AAGGCCATGA AAGAGGCCGG CATTCCCGTT
CCGCCTAGTA GCCCCGCCAA GTCGCCAGAG GAGGCTCTTG AGATCGCTAG GGGGCTGGGC
TACCCCATCG TTGTGCGCGT CTCCTTCAAC CTTGGCGGAG CCGGGGCCTT CGTTGCGAGG
AGCGAGGAGG CGCTGAGGGC GAGGATATAC AAGGCCTTCG CCCAGTCGGC CATTGGGGAG
GTCCTCGTGG AGAAGTACCT AGAGGGCTGG AAGGAGGTGG AGTTCGAGGT GGTGAGAGAC
GCCTACGACA ACGTAGCCGC CGTGGTGTGC ATGGAGAACG TGGACCCCAT GGGCGTCCAC
ACAGGCGACT CCATTGTCGT GGCGCCGTGC CTCACACTAA CTGACGAGGA GTACCAGAAG
GCTAGGGACA TCTCGATAGG GGTGGCCAGG TCGATTGAGC TGGTGGGCGA GGGCAACGTC
CAAGTGGCGG TCAACTACGC CGGACCTGAG CAGTACGCCA TTGAGACCAA CCCCCGTATG
TCCCGCTCAA GCGCCCTTGC CTCCAAGGCC TCTGGGTACC CCCTGGCGTA CATCGCGGCT
AAGCTTGCCC TCGGCTACCG CCTGGACGAG GTGATGAACC AGGTGACGAG GCGGACGGTG
GCCGCATTCG AGCCGTCGCT GGACTACATA GTTGTGAAGC ACCCGCGCTG GGAGAACGAC
CGGTTCGGAG TTACGGAGGG CCTCGGCCCC GAGATGATGT CCATCGGCGA GGCGATGGGC
ATAGGAAGGA CGCTGGAGGA GGCGTGGCAG AAGGCCATCC GCATGATAGA CATCGGCGAG
CCGGGCTTGG TGGGAGGCCC CATGTTCGAA AGCCTCACGC TGGAGGAGGC CCTTAGGTGC
GTGGAGAGGT ACTTGCCGTA CTGGCCCATA TGCGCGGCTA AGGCGCTCTA CCTTGGCGCG
TCGGTGGAGG ACATATACCA GCGGAATAGA GTAGACAAGT TCTTCCTAAA CGCCATAAAA
CGCGTCGTGG ATTCCTACAA AGGGCTTGAG GCCGGCTCCT ACGACCTCGA GGAGTTGAAG
ATCTTGGGCT TCTCCGACGC CCAGATCGCC AAGGCCTTGA AGAAGCCCGT CGACGAGGTG
AGGAGGGCGA GGAGGGCCCC CGTGGTGAAG AAGATAGACA CCCTAGCGGG GGAGTGGCCG
GCGGATACCA ACTACCTCTA CCTAACCTAC GGCGGCCAAT ACGACGACGA GACGCCTAGG
GCGGACTTCC TCGTGGTGGG GGCCGGCGTG TTCAGAATCG GCGTGTCGGT GGAGTTCGAC
TGGGCCACGG TGAACTTGGC AAAGGAGCTG AGGGACAGGG GGTACCGCGT CGCGATTCTC
AACTACAACC CCGAGACCGT CTCCACCGAC TGGGACGTGG TGGACAAGCT TTATTTCGAC
GAGATAACGG CTGAGAGGGT GCTGGACATT GTGGAAAAGG AGGGCCGCGA CGTGGTAGTA
GTCCTATACG CCGGGGGGCA GATAGGGCAG AGGCTATACG CCCCGCTTGA AAAGGCGGGT
GTCAAAATCG GCGGCACCAA GGCGCGCTCT ATCGACGCGG CGGAGGACCG GAGCAAGTTC
TCAAAGCTAC TTGACAGGCT CGGGATTAAG CAACCTCCCT GGCTCTACGC CTCCAGCGTC
GAGGAGGCGG TGAAGCTGGC GGAGGATTTG GGATACCCCG TCTTGGTGAG GCCTAGCTAC
GTCCTCGGCG GCACCTATAT GGCTGTGGCG AACAACGCGG AGGAGCTGAG AAGCTTCTTG
GCAAAGGCGG CTAAGGTCAG CGGCGAGCAC CCAGTGGTGA TATCCAAGTT CATGCCCAGG
GGGATAGAGG CGGAGGTAGA CGCGGTTTCA GACGGCGTGG GGATAGTGGC AACCCCAATC
GAACACGTTG AGCCTCCTGG CATACACTCC GGCGACTCGA CCATGGTCCT GCCGCCGCGG
AGGCTGGAGG AGTGGGCTGT GCGGAGGATG ATAGACATAG CCCACATCAT TGCCAGAGAG
CTTGAGGTAA AGGGGCCTAT GAACGTCCAG TTTCTAGTAC AGGACGACGT CTATGTAATA
GAGGCGAACC TCCGCGCTAG CCGGTCCATG CCACTGGTAA GCAAGGCCAC CGGCGTCAAC
TACATGTCCC TAGTCGCAGA CGTCTTAGTC AACGGCCGCC TCGCGGTGGA CGAGGAGAGG
GTGGTCTTAA AGCCCTCCAA GTGGTGGGTG AAGTCGCCCC AGTTCTCCTG GGCCCGCCTA
AGAGGGGCAT ACCCGCGCCT CGGCCCCGTG ATGTACTCAA CAGGCGAGGT GGCCTCCAAC
AGCGCTGTGT TTGAAGAGGC ATTGCTCAAA AGCTGGCTCT CCGCCACGCC CAACAGAATA
CCGAAGAGGA ACGCCCTTGT CTATACCTAC GACCCCCATC ACGCCGAGCT GATCGGACAG
GCGGCCAGCC TCCTCTCTGC CAAGCTTCGG GTATATTCAC CGGAGGAGCT GGGGGATAAA
ATACTGGACG AGCTGAGGTG GCGCAGAATC GACATAGTAG TTACGGCGGG TACCACGCCC
GAAAAGGACT ATCACATTAG GAGGACGGCG GCTGACACAA ACACGCCTCT TGTGCTGGAC
TCTACCCTCG CCGTAGAGCT CGCAAAGGCC TTTCTCTGGT ATTATAAAAA CGGGAAACTA
GGAGTAGAAC CATGGTGA
 
Protein sequence
MPDIRKVLII GSGAIKVAEA AEFDYSGSQA LKAFREEGIS TVLVNPNIAT IQTSKLLADR 
VYFVPIARHF LEQVIERERP DAIACGFGGQ TALSACVELY DSGILSKYGV RVIGTPVRGI
KRALSRDLFQ KAMKEAGIPV PPSSPAKSPE EALEIARGLG YPIVVRVSFN LGGAGAFVAR
SEEALRARIY KAFAQSAIGE VLVEKYLEGW KEVEFEVVRD AYDNVAAVVC MENVDPMGVH
TGDSIVVAPC LTLTDEEYQK ARDISIGVAR SIELVGEGNV QVAVNYAGPE QYAIETNPRM
SRSSALASKA SGYPLAYIAA KLALGYRLDE VMNQVTRRTV AAFEPSLDYI VVKHPRWEND
RFGVTEGLGP EMMSIGEAMG IGRTLEEAWQ KAIRMIDIGE PGLVGGPMFE SLTLEEALRC
VERYLPYWPI CAAKALYLGA SVEDIYQRNR VDKFFLNAIK RVVDSYKGLE AGSYDLEELK
ILGFSDAQIA KALKKPVDEV RRARRAPVVK KIDTLAGEWP ADTNYLYLTY GGQYDDETPR
ADFLVVGAGV FRIGVSVEFD WATVNLAKEL RDRGYRVAIL NYNPETVSTD WDVVDKLYFD
EITAERVLDI VEKEGRDVVV VLYAGGQIGQ RLYAPLEKAG VKIGGTKARS IDAAEDRSKF
SKLLDRLGIK QPPWLYASSV EEAVKLAEDL GYPVLVRPSY VLGGTYMAVA NNAEELRSFL
AKAAKVSGEH PVVISKFMPR GIEAEVDAVS DGVGIVATPI EHVEPPGIHS GDSTMVLPPR
RLEEWAVRRM IDIAHIIARE LEVKGPMNVQ FLVQDDVYVI EANLRASRSM PLVSKATGVN
YMSLVADVLV NGRLAVDEER VVLKPSKWWV KSPQFSWARL RGAYPRLGPV MYSTGEVASN
SAVFEEALLK SWLSATPNRI PKRNALVYTY DPHHAELIGQ AASLLSAKLR VYSPEELGDK
ILDELRWRRI DIVVTAGTTP EKDYHIRRTA ADTNTPLVLD STLAVELAKA FLWYYKNGKL
GVEPW