Gene Pars_1969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1969 
Symbol 
ID5054536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1765092 
End bp1766051 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content50% 
IMG OID640469516 
Productglycosyl transferase family protein 
Protein accessionYP_001154168 
Protein GI145592166 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0472] UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGCCG AAATCGTGTC TGCCGTCGCC GCTTTTATAG CCGGTGTTGT TTTTGGCTTG 
TGGTGGGTAG GCGAGCAGAA AAGGCGCAAC ATCACATCCC GTGATATATA CAAGAATATT
AGTGGAGTGC CTAGAGCCGG GGGGCTAATA GCAATGGTGG CTGCAACTGT GGGGTATAGC
CTCTTGTCAA CAATTACGGA TAAGTCGTTG CTAGTTCTGG TGATATCGAT GATTATGGGG
ATCTTGGGGC TAGTTGACGA CTTGAAAGGG CTTAGCGAAT ACGTAAGGGT GCTAGTCCCG
GTGGTCTTAG CATTTGCGCT AGCCCGGACA AGTATGATAA CGCTTACTGT GCCGATGGTA
GGTCTTTTCT ATGGGGCAAC TGGTTGGCTC TCTGTCTTGG CAATTCCCGT ATTGACAAAT
GCCTTTAACA TGCTTGACCC GGTAAACGGC TTTCTTCCCA TGGCAAACAC CATAGTTGGC
CTCTCCCTAG CCGCGGTAGC CGCTATGAGG GGACAGTGGG ACGCCGTCTA TTTGTTGGCG
GTTCATGCGG CGGCTTCCCT TTCGCTGTAT GTGCACAACA GATACCCCGC CAAGACCTTC
AACGGTAATG TCGGTAGCTA CTTCTTGGGA GCTAGCATCT CTACAATAGC AGTACTCTAC
GACTTAGTCC CTTATCTGAT ACTAGCTGGT CTTCCCTTTG TTGTAAACGG GGCGTTGATA
ATATTCTCCT CAGGCGGGAT TAAGGGACGG GAAAAAATTG AGAGGCCTAC CTATTTGGAA
AACGGCCTCG TGTACCAACA ATGCAACTCA CCTATTATTT CCCTGGTTAG GTTAACTGTA
GCCAACGGGC CTATGAACGA GTACGGAATT TTCAAGGCGC TGACGGTGCT GACGGCGACG
ACCTCGGCAT TAACAGTAGC AACCACAGCA GTCATACACA TCTTTTCTTT ACCCATATGA
 
Protein sequence
MFAEIVSAVA AFIAGVVFGL WWVGEQKRRN ITSRDIYKNI SGVPRAGGLI AMVAATVGYS 
LLSTITDKSL LVLVISMIMG ILGLVDDLKG LSEYVRVLVP VVLAFALART SMITLTVPMV
GLFYGATGWL SVLAIPVLTN AFNMLDPVNG FLPMANTIVG LSLAAVAAMR GQWDAVYLLA
VHAAASLSLY VHNRYPAKTF NGNVGSYFLG ASISTIAVLY DLVPYLILAG LPFVVNGALI
IFSSGGIKGR EKIERPTYLE NGLVYQQCNS PIISLVRLTV ANGPMNEYGI FKALTVLTAT
TSALTVATTA VIHIFSLPI