Gene Pars_1035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1035 
Symbol 
ID5054600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp922253 
End bp923380 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content62% 
IMG OID640468591 
Product2-methylcitrate synthase/citrate synthase II 
Protein accessionYP_001153265 
Protein GI145591263 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID[TIGR01800] 2-methylcitrate synthase/citrate synthase II 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACCTCC CTGGTCTAGA AGGGGTTGTG GTAAAAGAAA CGAAGATATG CTACATCGAC 
TTGGAAAATT CCAAGATATA CTACCGCGGC TACGACCTCG AAGAGCTGGC CCGGCTGTCC
ACTTTTGAGG AGGTGACCTA CCTCCTTTGG TTCGGCCGGC TCCCGGGCAG GCGAGAGCTG
GAGGAGTTTA AGGCTAGGCT CGCGGCGCAT CGGCTACCCC TGCCGCACGT CGCGGCGCTG
GCGAAATCAG CGCCGCCATC GGCTGAGCCT ATCGACGTGT TGAGGACGGC GGTTTCGGCA
ATGGCTTGGG GGGAGGATCT TTCAGACAAG TCGCCGGAGG CGGAGCTCCA GAGGGGGTTG
AAGATAACCG CGGCGATGCC CTACGTCGTG GCGGCTTTTG ACAGGGCTAG AAGGGGGCAA
GAGCCTGTCC ACCCGGCGGA GGCGGGGAGC CACGCGGAGT ACTTCCTCTG GGCGCTTAGG
GGGGAGAGGC CCAGCCCGCG GGAGGCCAGG GCGATGGACG TCATGCTGAT AGTATACGCA
GAGCACTCCA TGAACAACAG CGCCTTCACC GCAGTTACCG TGGCCTCAAC CTTCGCCGAC
ATGTACGCTG CCGTCACCGC GGCCGTGGCC AGCCTCAAGG GGCCTCTCCA CGGCGGGGCC
AATGTAGACG CCGCGAAGAT GATCGAGGAG ATAGGAGACG CCAAGAAGGT GGAGCGCTGG
GTCGATGAGC AACTGGCCAA GGGGCGGAGG ATACCGGGCT TCGGACACCG GCTGTACAAG
AAGGGCCCCG ACCCGAGGCT GAGGGTTCTG AGGGAGCTGG CTAAAGGGCT AGCGGCGGAG
AGGGGTGACT TCCGCTGGGT GGAAATCGCC GAGCGGCTCG AAGATTACGT GACGGCTAAG
CTGGCGGCGA AGGGCATCTA CCCCAACACC GACCTATACG CCGCGGTGAT CTTCCGCTAC
CTCGGCCTAC CCGTTGACAT AAACCTGCCG ACCTTCGCCA TATCCCGCGC GGCTGGATGG
GTCGCCCACG TCTTGGAATA CCGCCAAGCG AATCGCCTCA TAAGGCCGAC AGAGAAATAC
GTCGGCCCCA TTGGGCTTAA GTACATCCCA CTGGAGGAGC GGAGCTAG
 
Protein sequence
MYLPGLEGVV VKETKICYID LENSKIYYRG YDLEELARLS TFEEVTYLLW FGRLPGRREL 
EEFKARLAAH RLPLPHVAAL AKSAPPSAEP IDVLRTAVSA MAWGEDLSDK SPEAELQRGL
KITAAMPYVV AAFDRARRGQ EPVHPAEAGS HAEYFLWALR GERPSPREAR AMDVMLIVYA
EHSMNNSAFT AVTVASTFAD MYAAVTAAVA SLKGPLHGGA NVDAAKMIEE IGDAKKVERW
VDEQLAKGRR IPGFGHRLYK KGPDPRLRVL RELAKGLAAE RGDFRWVEIA ERLEDYVTAK
LAAKGIYPNT DLYAAVIFRY LGLPVDINLP TFAISRAAGW VAHVLEYRQA NRLIRPTEKY
VGPIGLKYIP LEERS