Gene Pars_1827 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1827 
Symbol 
ID5056177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1637906 
End bp1638934 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content50% 
IMG OID640469373 
Product3-dehydroquinate synthase 
Protein accessionYP_001154030 
Protein GI145592028 
COG category[C] Energy production and conversion 
COG ID[COG0371] Glycerol dehydrogenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0648145 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.20433 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGCAAC TTGAGAGTTT TGAGATCCCG AGAACAGTCA TCTTTGGGCC AGGCGCAATT 
TCGAAAACCC CTCAAGTAGT TGCCAAGCAC AAGGCGGAGA GAATCCTAAT AATATCAGGT
AAATCTGTTA CTGCCAACTA CGCCAATGAG GTCGCACATT TGCTATCAGG TTACAGCGTA
GACGTGGTAA GATACGACGA GGTAGATACA AGCTATTCGA AATACGACTT AGTGTTGGGC
GTCGGGGGCG GGAGGCCTAT TGACGTGGCC AAAGTGTACT CATATCTGCA TAGGGCTCCT
CTAATAGTTA TCCCCACTTC GGCCAGCCAC GACGGAATTG CCTCGCCATA CGTGTCGTAT
GCCCTATCCC AGAAAATGGC CTCGCATGGG AAAATAGTGG CATCTCCCAT AGCGATAATA
GCTGACACCA CCGTAATCCT CAACGCGCCT TCTCGGTTGT TGAAAGCAGG AATAGGAGAC
CTCCTTGGAA AAATAGTTGC TGTACGTGAT TGGCAACTTG CCCATAGGCT AAAAGGCGAG
GAGTACAGCG AATACGCCGC CCACCTGGCG CTCACCAGCT ATAGAATAGT GGTTTCTAAC
GCTTTCAGAA TCAAGAACTT TACTAAGGAG GAAGATGTGA GAGTTTTAGT AAAGGCCCTT
ATAGGATGCG GCGTAGCTAT GGGCATTGCA GGTTCATCGC GGCCGTGTAG TGGCTCTGAA
CACCTCTTTG CCCACGCCGT CGAGTTACTG CTAGGGGAGA AGAACAACGA GGCCATACAC
GGCGAGTTAG TAGCCCTAGG CACTGTGGTA ATGGCCTACC TACATGGCAT GAACTGGCGC
CGGATAAAAA GAGTAGCAAA AGAGGTGGGG CTTCCAACTA CTTTGAAACA GATAGGTATA
GACGCAGATG TGGCTATAGA GGCCTTAACA ACAGCACACA CCCTCCGCCC AGATCGCTAC
ACAATTTTAG GGAGTGGACT AGGGAAAGAG GCAGCCAGAC GCGCCTTGGA AACTACAGAA
TTAATATAA
 
Protein sequence
MKQLESFEIP RTVIFGPGAI SKTPQVVAKH KAERILIISG KSVTANYANE VAHLLSGYSV 
DVVRYDEVDT SYSKYDLVLG VGGGRPIDVA KVYSYLHRAP LIVIPTSASH DGIASPYVSY
ALSQKMASHG KIVASPIAII ADTTVILNAP SRLLKAGIGD LLGKIVAVRD WQLAHRLKGE
EYSEYAAHLA LTSYRIVVSN AFRIKNFTKE EDVRVLVKAL IGCGVAMGIA GSSRPCSGSE
HLFAHAVELL LGEKNNEAIH GELVALGTVV MAYLHGMNWR RIKRVAKEVG LPTTLKQIGI
DADVAIEALT TAHTLRPDRY TILGSGLGKE AARRALETTE LI