Gene Pars_2065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2065 
Symbol 
ID5054797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1847000 
End bp1848283 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content52% 
IMG OID640469614 
ProductGlu/Leu/Phe/Val dehydrogenase, C terminal 
Protein accessionYP_001154263 
Protein GI145592261 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0334] Glutamate dehydrogenase/leucine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.86041 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.202815 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTCG GCCTCCACAT AATGGCAAGC GAGCACGGCT TCCTCACGCA CGTACTGGGA 
AACCTGAGGA GGGGGGTTGA GCTCGGCGGA TTTCCAGAAG ACTTCTACAA GGTAATATCA
AGGCCAAAGA GAGTGTTGCA AGTCTCAATA CCAGTAAAAA TGGACAACGG CCAGATTGAG
GTCTTCGAGG GTTATCGTGT TCAGCATTGT GATGCTTTGG GGCCTTTTAA GGGTGGTATC
CGTTTTCATC CGGAGGTTAC TCTTGCTGAT GATATTGCTC TTGCCATGTT GATGACGCTT
AAGAATAGCC TCGCCGGCCT CCCATACGGC GGCGCTAAAG GCGCCGTCCG CGTCGACCCA
AAAAAACTAT CGGCAAGAGA GCTTGAAGAG CTCTCCAGAG GCTACGCCAG AGCCATTGCG
CCTTTAATAG GCGACGTCGT GGACATACCA GCCCCAGACG TAGGCACCAA CGCCCAGATA
ATGGCGTGGA TGACAGACGA ATACTCCAAA ATAAAAGGCC ACAACACCCC CGGCGTATTC
ACCTCCAAAC CACCAGAACT CTGGGGAAAC CCAGTAAGAG AATACGCCAC CGGCCTCGGA
GTAGCAGTAA CCACAAGAGA AATGGCCAAA AGACTCTGGG GAGAAATAGA AGGAAAAACC
GTGGCGATAC ACGGAGCTGG GAACACCGGG GCGTGGGCCG CCTACTGGCT TGGAAGAATG
GGCGCCAAGA TAGTGGCTAT ATCAGATTCC AAAGGCTCTG TAATAAACGC CAAGGGGATC
CCCGCTGAGG ATATCTTAGG AGTTTACAAG GAGAAGTCCG TAAACCCCCA GGTCTCCGTC
ACTATGCTTG AGGGCAACAA GGGGTCTCCA GATGCCCCGT TGTATCAAGA CGTTGATGTT
CTTATTCCTG CTACTATTGA GAATGTGATT CGGGGGGATA ATGTCGGTTT GGTTAAGGCT
AGGCTGGTGG TGGAGGGTGC TAATGGGCCT ACTACTCCGG AGGCTGAGAG GGAGCTTTAC
AAGAGGGGTG TGGTGGTGGT GCCCGACATC TTGGCCAACG CCGGCGGCGT CGTCATGTCG
TACTTGGAGT GGGTGGAAAA CCTCCAGTGG TATTTCTGGG ATGAGGAGGA GACTAGAAAA
AGACTAGAAG CCATAATGGT AAACAACGTG GCGAAGGTAT ACCACCGGTG GCAAAAAGAA
AAAGAATGGA CCATGAGAGA CGCCGCCATA GTCACAGCCC TAGAAAGAAT ATACAAAGCA
ATGAAAACAA GAGGATGGAT CTAA
 
Protein sequence
MKFGLHIMAS EHGFLTHVLG NLRRGVELGG FPEDFYKVIS RPKRVLQVSI PVKMDNGQIE 
VFEGYRVQHC DALGPFKGGI RFHPEVTLAD DIALAMLMTL KNSLAGLPYG GAKGAVRVDP
KKLSARELEE LSRGYARAIA PLIGDVVDIP APDVGTNAQI MAWMTDEYSK IKGHNTPGVF
TSKPPELWGN PVREYATGLG VAVTTREMAK RLWGEIEGKT VAIHGAGNTG AWAAYWLGRM
GAKIVAISDS KGSVINAKGI PAEDILGVYK EKSVNPQVSV TMLEGNKGSP DAPLYQDVDV
LIPATIENVI RGDNVGLVKA RLVVEGANGP TTPEAERELY KRGVVVVPDI LANAGGVVMS
YLEWVENLQW YFWDEEETRK RLEAIMVNNV AKVYHRWQKE KEWTMRDAAI VTALERIYKA
MKTRGWI