Gene Pisl_1184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_1184 
SymbolcarB 
ID4617697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp1074653 
End bp1077730 
Gene Length3078 bp 
Protein Length1025 aa 
Translation table11 
GC content56% 
IMG OID639784277 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_930695 
Protein GI119872688 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ)
[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAACA TTAGGAAAAT CTTGGTGATA GGGTCCGGCG CTATTAAAAT TGCTGAAGCG 
GCGGAGTTTG ACTACTCGGG CTCGCAGGCC CTGAAGGCGT TTAGGGAGGA GGGGATAGAG
ACGGTGCTTG TCAACCCCAA CATAGCCACT ATACAGACGT CGAAGTTTCT GGCGGATAGG
GTCTACTTCG TGCCTATACA GAGGCAGTTC TTGGCGGAGG TCATAGAGAG GGAGAGGCCT
GACGCAATTG CGTGTGGCTT TGGCGGACAG ACGGCGCTGT CGGCCTGCGT AGATCTACAC
GACTCCGGTG TCTTGGATAA ATACGGCGTT AAGGTTGTGG GAACGCCGAT ACGGGGAATA
AAGAGGGCAC TATCGAGGGA TTTATTTCAA AAGGCCATGA GGGAGGTCGG CATACCGATC
CCGCCCAGTA GCCCCGCGAG ATCGCCCGAG GAGGCGCTGA AGGTCGCGCG GGAGATAGGC
TATCCGGTGG TGGTGAGAGT GAGCTTCAAC CTAGGCGGGG CGGGCGCCTT CGTTGCCAGG
AGTGAGGAGG ACCTAAGGGC CAGGGTGTAT AAGGCCTTCG CCCAATCTGC AATTGGGGAA
GTCCTTGTGG AGAAGTACTT GGAGGGGTGG AAGGAGATAG AGTTTGAGGT TGTGCGCGAC
GCCTACGACA ACGTCGCCGC AGTGGTCTGT ATGGAGAACG TGGACCCCAT GGGCGTACAC
ACGGGGGACT CCATAGTGGT GGCCCCCTGT CTCACTTTGA CAGATGAGGA GTACCAGAAG
GCTAGGAATA TCTCCATCGG CGTGGCGCGC GCCATCGAGC TAGTGGGCGA GGGCAACGTC
CAAGTGGCGG TCAACTACGC TGGACCTGAG CAGTACGCCA TAGAGACAAA CCCACGTATG
TCCCGCTCTA GCGCCCTTGC CTCCAAGGCC TCGGGCTATC CGCTTGCCTT CATCGCAGCT
AAGTTGGCCT TGGGCTACCG TCTGGACGAG GTTTTGAACC AGGTGACGAG GCGGACGGTG
GCGGCGTTTG AGCCCGCGCT TGACTACATA GTGGTCAAAC ACCCGAGGTG GGAGAACGAC
AGATTCGGCG TATCCGAGGG CCTGGGGCCA GAGATGATGT CTATCGGCGA GGCCATGGCG
GTGGGGAGGA CGCTGGAGGA GGCTTGGCAG AAGGCGGTTA GGATGATCGA CATAGGCGAG
CCCGGCCTAG TGGGCGGGCC TATGTTTAAA GAACTGACGC TTGAGGAGGC GAGACGGTGT
CTAGAGGGGT ATAAGCCCTA CTGGCCGATA TGCGCCGCCA AGGCCATGTA CCTAGGCCTA
TCCATAGACG AGGTGTACAG CTACGTGAAG GTGGATAGGT TCTTCCTAAG GGCTATACAA
CGCGTCGTAG AGGCCTACAA GGCGCTTGAG CAAGGCCGGT ACGACCTGGA GGAGCTGAAG
GTGTTGGGCT TCTCAGACAG TCAGATAGCC AAGGCGCTGG GGGTTGAGGA GGAGGAGGTG
AGACGGGCGA GGAGGCGGCC AGTGGTGAAG AGGATAGATA CGCTTGCCGG GGAGTGGCCA
GCCGAGACGA ACTACCTCTA CCTCACCTAT GGCGGCGTAT ACGACGATGA CGTGCCACCG
GCGGACTACT TGGTGGTGGG CGCCGGCGTC TTTAGAATAG GCGTAAGCGT GGAGTTCGAC
TGGTCTACCG TAAACCTGGC GCAGGAGCTG AGAAACAGGG GATTCAAAGT GGCTATCTTG
AACTACAACC CCGAGACGGT ATCGACGGAC TGGGACATAG TGGATAAGCT CTACTTCGAC
GAGATCTCCT ATGAAAGGAT ACTAGACATA GTGGAGAAGG AGGGCAGCGG CATTACGGTT
GTCCTCTATG CGGGCGGCCA GATAGGCCAG AGATTGTATA AGCGTCTTGA GGCGGCGGGA
GTAAAGATCG GGGGGACCCG CGCCGCGTCT ATAGATGTGG CAGAGGACCG GAGTAAGTTT
TCGGAGCTCC TAGAGAAACT CGGTATAAAA CAACCGCCGT GGTTCGCCGC TAGATCCCTA
GAAGAGGCGG CTAAACTCGC CGAGGCGCTG GGCTATCCCG TGTTGGTGAG GCCCAGCTAC
GTCTTAGGCG GCACCTACAT GGCCGTGGCT TACGACAGAG AGGGGCTCCT GAGCTTTCTT
ACAAAGGCGG CTAAAGTGAG CGGGGAGTAC CCGGTGGTCG TGTCCAAGTT CATACCTCGC
GGCATTGAGG CAGAGGTAGA CGCCGTGTCT GACGGAGTTC GGCTCGTCGC CACTCCCATC
GAGCATGTGG AGCCGCCTGG CGTACACTCT GGCGACTCCA CCATGGTGCT CCCGCCGAGG
AGGCTGGAGG AGGGGGCCGT TAAAAAGATG GTCGAAGCAA CGCAGAGGAT CGCTAGCGAA
CTCGGGGTCA AGGGTCCTCT CAACGTCCAG TTTATAGTCT ACGATGACGT GTACGTAATA
GAGGCGAACC TCAGGGTAAG CCGCTCTATG CCCTTTGTAA GCAAGGCCAC GGGGGTGAAC
TACATGTCTC TGACAGCAGA TGTGTTGGTG AATGGCCGTC TCGCCGTAGA TGAGGAGACT
GTGGTGCTTA AGCCGACTAA GTGGTGGGTG AAGTCTCCAC AGTTCTCTTG GTCTAGGCTG
AGGGGGGCAT ACCCGCGGCT GGGGCCTGTT ATGTATAGCA CGGGGGAGGT GGCCTCAAAC
GGGGCCACCT ATGAGGAGGC ATTGCTTAAG AGTTGGCTAT CTGCAACGCC TAATAAAATA
CCGGAGAGAT CTGCCCTAGT ATATACATAC GATAAGCATA GCGAAGAAAC TATTTTACAA
GTTGCGTCTC TACTCTCCAC AAGGCTTGAA GTATATACGC CGGAGCAATT AGGCGAGAAG
ATCGTGGATA TGTTAAAATG GAAGAAGATA GATATTGTGA TGACGGCAGG TGTAACGCCA
GAGAGAGACT TCTTAATTAG GAGGACTGCT GCAGATACCA ATACCCCTCT TGTATTAGAC
GCCACACTAG CTCTAGAGCT CACTAAGGCG TTTATCTGGT ATTATAAAAA CGGGAAGTTT
GAAATATCGC CGTGGTAG
 
Protein sequence
MPNIRKILVI GSGAIKIAEA AEFDYSGSQA LKAFREEGIE TVLVNPNIAT IQTSKFLADR 
VYFVPIQRQF LAEVIERERP DAIACGFGGQ TALSACVDLH DSGVLDKYGV KVVGTPIRGI
KRALSRDLFQ KAMREVGIPI PPSSPARSPE EALKVAREIG YPVVVRVSFN LGGAGAFVAR
SEEDLRARVY KAFAQSAIGE VLVEKYLEGW KEIEFEVVRD AYDNVAAVVC MENVDPMGVH
TGDSIVVAPC LTLTDEEYQK ARNISIGVAR AIELVGEGNV QVAVNYAGPE QYAIETNPRM
SRSSALASKA SGYPLAFIAA KLALGYRLDE VLNQVTRRTV AAFEPALDYI VVKHPRWEND
RFGVSEGLGP EMMSIGEAMA VGRTLEEAWQ KAVRMIDIGE PGLVGGPMFK ELTLEEARRC
LEGYKPYWPI CAAKAMYLGL SIDEVYSYVK VDRFFLRAIQ RVVEAYKALE QGRYDLEELK
VLGFSDSQIA KALGVEEEEV RRARRRPVVK RIDTLAGEWP AETNYLYLTY GGVYDDDVPP
ADYLVVGAGV FRIGVSVEFD WSTVNLAQEL RNRGFKVAIL NYNPETVSTD WDIVDKLYFD
EISYERILDI VEKEGSGITV VLYAGGQIGQ RLYKRLEAAG VKIGGTRAAS IDVAEDRSKF
SELLEKLGIK QPPWFAARSL EEAAKLAEAL GYPVLVRPSY VLGGTYMAVA YDREGLLSFL
TKAAKVSGEY PVVVSKFIPR GIEAEVDAVS DGVRLVATPI EHVEPPGVHS GDSTMVLPPR
RLEEGAVKKM VEATQRIASE LGVKGPLNVQ FIVYDDVYVI EANLRVSRSM PFVSKATGVN
YMSLTADVLV NGRLAVDEET VVLKPTKWWV KSPQFSWSRL RGAYPRLGPV MYSTGEVASN
GATYEEALLK SWLSATPNKI PERSALVYTY DKHSEETILQ VASLLSTRLE VYTPEQLGEK
IVDMLKWKKI DIVMTAGVTP ERDFLIRRTA ADTNTPLVLD ATLALELTKA FIWYYKNGKF
EISPW