Gene Pars_0143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0143 
Symbol 
ID5055877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp131745 
End bp132827 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content58% 
IMG OID640467722 
Productcarbamoyl phosphate synthase small subunit 
Protein accessionYP_001152410 
Protein GI145590408 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0505] Carbamoylphosphate synthase small subunit 
TIGRFAM ID[TIGR01368] carbamoyl-phosphate synthase, small subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCACATTG ATGTTAGTGG GGCGAATAAT AGGGGGTACC TCGTACTTGA AGACGGCACG 
GTTTTTGTAG GCAGACTCAT CGGGGCAGAA AAGACTGCAA TAGGTGAGGT TGTCTTCACC
ACGGCGGTGG TGGGCTATCC CCAAATCCTA ACAGATCCCA GCTACAAAGG CCAGATAATA
GTCTTCACAA AGCCACTGGT GGGCAATTAC GGCGTTTCGG AAGATCAGAT GGAGTCAGAC
GGCGTAAAGG CCGAGGGGGT TGTCCTCTTC GAGGCGACGA AGCCCAGCCA CTACAAGTCC
GTAATGACTC TGGAAGAGTG GCTTGCCGCG TCGGGCATCC CCGGAGTTGC CCGGGTAGAC
ACGAGAGCCC TTGTTCTAAA GCTAAGGGAG CACGGGGTGA TGATGGGGGC GGTGGGGCCT
GAAGAGCCGG GCGCCCTTCT AGAGGCGCTA CGCAAGTCCC CCCGCTACGA GGAGTTGTCG
TACGTAGATC TGGTCTCGGT GCAGGAGCCC GTCGAGCTGG GAGAGGGGAG GCTCTGCGTA
GGCATAGTAG ACTGCGGCGT GAAGAAGTCG ATTGTAAAGG AGTTCCTCAA GAGAGGTGTT
AGAGTCCGGC TGGTGCCGTG CAGGAGGCCA GAGGCGGCGT TCGACTGCGA CGGCTTGTTC
TTAAGCAACG GGCCGGGGAA CCCCCAACTT CTAGACTCCC TCGCGTCAAA AGTGGCGCAG
TACGCAGAGT ATAAAAAGCC GCTTATGGGG ATCTGCCTAG GGCACCAGGT AATCGCGATG
GCGTTCGGCG CCCGCATCTA CAAACTGAAA TTCGGCCACA GGGCGAGCAA CAAGCCCGTG
AGGGATCTCC GCTTCACGGG GAAGACCTAC ATAACAACCC ACAACCACGG CTATGCCGTG
GATCCAGAAG GGACTGAGCT AAAGGTCTGG GCTGTTCAAC CAGACGACGG GACGGTGGAG
GGGCTCTACC ACGAACGCCA GCCGGTGTTC ACTACGCAGT TCCACCCCGA GGCCTCGCCA
GGACCCCGCG ACACGCTTTG GATTTTCGAC AAGTTTCTGG GCTTGATAGA GCGCCATGCC
TGA
 
Protein sequence
MHIDVSGANN RGYLVLEDGT VFVGRLIGAE KTAIGEVVFT TAVVGYPQIL TDPSYKGQII 
VFTKPLVGNY GVSEDQMESD GVKAEGVVLF EATKPSHYKS VMTLEEWLAA SGIPGVARVD
TRALVLKLRE HGVMMGAVGP EEPGALLEAL RKSPRYEELS YVDLVSVQEP VELGEGRLCV
GIVDCGVKKS IVKEFLKRGV RVRLVPCRRP EAAFDCDGLF LSNGPGNPQL LDSLASKVAQ
YAEYKKPLMG ICLGHQVIAM AFGARIYKLK FGHRASNKPV RDLRFTGKTY ITTHNHGYAV
DPEGTELKVW AVQPDDGTVE GLYHERQPVF TTQFHPEASP GPRDTLWIFD KFLGLIERHA