Gene Pars_1870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1870 
Symbol 
ID5055843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1673557 
End bp1674726 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content57% 
IMG OID640469416 
ProductSerine--glyoxylate transaminase 
Protein accessionYP_001154073 
Protein GI145592071 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0075] Serine-pyruvate aminotransferase/archaeal aspartate aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.856595 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGTAA GTAGCGTGTA TCGTAGATTT GCGCAGAAGA GAGTCCTCAC TCCGGGGCCT 
ACAGAGTTGC CGCCGTGGGT TAGGGCCGCT TTGGCGAGGG AGACAACTAA CCCAGATTTA
GATCCTGGGT TTTTGCGGGA GTATGAGGAG GTGGTAGAAA TGTTGAGGGC ACTTGTCGGT
GCTTGGCAGT CTCGGGTGTA TGTGTGGGCT GGGGAGGCGA TGCTAGGTCT TGAGGCCGCC
GTTGCGAACG CTGTGAGGCC TGGATCTAAG GTTTTGGTTG TAGACAACGG CGTGTACGGC
GCTGGATTTG CCGACTTGGT GAAGATGTAC GGCGGGGAGC CGGTGTTGCT TGGACTTGAC
TGGAGAAGTG CCGCCGATCC GGCGGCGGTG GATAGGGCGC TTGAGAGGGA GAGAGATGTG
GAGGTAGTTA CGCTGGTTCA TTGCGATACG CCGACGGGTG TGTACAATGG CTTGGAGGAA
ATTGCCAAGG TAGTGTCGGC CCATGGCGCG TTTCTAATAG TCGACGCAGT CTCCTCAGTG
GGTGCTGATG TGATCGACGT AGACAGATGG GGTATAGGCG CGTTAATCGG CGGCTCGCAG
AAAGCTCTAA ATGCGCCGCC TGGACTCACT ATAATGGCCG TGAGTAAAAG GGCGCTTGAG
AGAGAGGCTG AGGTGGGGCG TAGGTCGTAC TACATGAGCT ACCGGGTGTG GGAGGAGTGG
TTGGAGAAGG AGGGCTTCCC CTACACAATG CCAGATTTGT TGATATACGC ATTGAAGGAG
AGTTTGAAGA AAATACAAGA AGAGGGCTTA CACTCCGTTG TCGCTAGACA CAAAGCCGCT
AGGGCCGCGG CAAGGAGGGG TGTGGAGGCC CTAGGGCTAG AGCCTTTCGC TAGGCGTGTG
GAGTGGAACT GCCCAACGGC CACAGCCTTC AAGACTCCGA TCCCGGCGCC GGAGTTCAGG
AGGCATATTT GGGAAAAGTA CGGCATAATG CTGGCAGGAA GCTGGGGCCC AGTGGAGAGG
GAGGTTATGA GAATTGGCCA CATGGGGGTA CAAGCCTCGG CTGATCACCT GGCGGTAGCG
ATATCGGTGC TGGGAGCCGC GCTACGGGAC TACGGATTCA ACGTACCAGT GGGGAAGGCC
GTAGAGGAGG CGCTGGAGGC GTTTAGGTAG
 
Protein sequence
MVVSSVYRRF AQKRVLTPGP TELPPWVRAA LARETTNPDL DPGFLREYEE VVEMLRALVG 
AWQSRVYVWA GEAMLGLEAA VANAVRPGSK VLVVDNGVYG AGFADLVKMY GGEPVLLGLD
WRSAADPAAV DRALERERDV EVVTLVHCDT PTGVYNGLEE IAKVVSAHGA FLIVDAVSSV
GADVIDVDRW GIGALIGGSQ KALNAPPGLT IMAVSKRALE REAEVGRRSY YMSYRVWEEW
LEKEGFPYTM PDLLIYALKE SLKKIQEEGL HSVVARHKAA RAAARRGVEA LGLEPFARRV
EWNCPTATAF KTPIPAPEFR RHIWEKYGIM LAGSWGPVER EVMRIGHMGV QASADHLAVA
ISVLGAALRD YGFNVPVGKA VEEALEAFR