Gene Pars_1014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1014 
Symbol 
ID5054751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp905698 
End bp907080 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content55% 
IMG OID640468572 
Productphosphoenolpyruvate carboxylase 
Protein accessionYP_001153246 
Protein GI145591244 
COG category[S] Function unknown 
COG ID[COG1892] Uncharacterized protein conserved in archaea 
TIGRFAM ID[TIGR02751] phosphoenolpyruvate carboxylase, archaeal type 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.189328 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.549107 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATGCATA TACCGCGTTT GATGTGTACA CAGCATCCTG ACACCACCGT TAAGATCACC 
ACTGCAGAGG AGGTTGACGA GGCCATAGTG GCGTACACCG CGTATGGGTG CGATGAGGTG
ATGGTAGACT ACGAGGGAAA GATGACGCCA TATGGACAGC CCAAGGAGAT TGTAATGAAG
GCGATCAGGG GCGATGTGCC TCTTGGCGAT GAGTTCTACA TCACTGTGAG GCTTCCTAAT
CCCAAACTGG AGGAGTTTGA TAGAGCGATG CTTTCGCTAG AGGCGGCTCT CGTGGCGAAC
TACTTCTCTA GGAGGTACGC CGACGCGCAG GCGGTGAGGT GGGTGGTGTT GCCCATGGTG
GAGGATTTTG ATACTGTGAT ACTGGTGAGG AGGATGCTGA GGCGCAAGGC CGAGATCTAC
AAGTCGGAGA CAGGTGTTGA CGTGGGGGAG GTGGAGGTGA TCCCACTTAT AGAAGACGCC
TTTGTGCAGG TGAAGGCTAA GGTGATTGTC GGCGAGGTCT TCAAGAGCGA GGAGGCAAGG
GAGGTGAGGG TCTTCTTAGG CAAGTCCGAC TCGGCGGTGA GGCACGGCCA CCTCGCGTCG
GCGTTGGCTA TAATCTACGC TATGTCGAAA CTGAAGGAGT TCGAGGCAGA GTCAGGTATA
AGGGTGCGGC CCATCTTAGG CATGGGATCA CCGCCTTTTA GAGGCGCCTT AAACAACCCG
AGGCTGGCCC ACTTGGAGGT GGTGCAGTAC GCGGGCTACT ACACCGCCAC GATACAGTCA
GCAGTTAGGT ACGACACGTC GCTAGACGAG TACGTAAAGG TCAGGGAGTC TATCCTCAAC
GCTTGTTGCG GCACGAGGGG CCTGGTGGGG GAGGAGGTGT TGCCGCTCAT ACAGGAAGCC
TCCGCTAAGT ACAGGTCACA AGCTATGAAG CACGTGGATA AAATAGCGGA GGTGGCCCGT
CTCGTGCCAA GCACTAGGGA TAGGGTTAGC TGGAAGGAGT ACGGCAGATC CCTACTTGAC
GGAGACCGAG TAGTGCATAT GCCCCGTGCC ATTGTATACA CGTCGGCGTG GTATGCCATG
GGATTCCCGC CGACGCTCAT TGATGCGCCG TTACTCTTGG AGCTGGCTAA GTCGGATAAG
CTAGATGCGG TGTTTAAACT CCTTCCTACC TACAAGATGG AGCTGGAATA CGACTACGAG
TTTTTCGACC CACAGACAGC GAGGAACTAC CTAAGTGAGG AGCTCGTATA CGCCGCGGTG
GAACTCGCCG ACTATCTCGG CGTAGAGGCA AGGCCCACCC CCACTTACAC CGCGTTGTTG
AAGATGCCTA GAAGCGAGCC TAACATAATT GCACTGAGCA AATATCGAAA ATTCCTGGGA
TAA
 
Protein sequence
MMHIPRLMCT QHPDTTVKIT TAEEVDEAIV AYTAYGCDEV MVDYEGKMTP YGQPKEIVMK 
AIRGDVPLGD EFYITVRLPN PKLEEFDRAM LSLEAALVAN YFSRRYADAQ AVRWVVLPMV
EDFDTVILVR RMLRRKAEIY KSETGVDVGE VEVIPLIEDA FVQVKAKVIV GEVFKSEEAR
EVRVFLGKSD SAVRHGHLAS ALAIIYAMSK LKEFEAESGI RVRPILGMGS PPFRGALNNP
RLAHLEVVQY AGYYTATIQS AVRYDTSLDE YVKVRESILN ACCGTRGLVG EEVLPLIQEA
SAKYRSQAMK HVDKIAEVAR LVPSTRDRVS WKEYGRSLLD GDRVVHMPRA IVYTSAWYAM
GFPPTLIDAP LLLELAKSDK LDAVFKLLPT YKMELEYDYE FFDPQTARNY LSEELVYAAV
ELADYLGVEA RPTPTYTALL KMPRSEPNII ALSKYRKFLG