Gene Pars_1777 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1777 
Symbol 
ID5055522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1597678 
End bp1598769 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content53% 
IMG OID640469322 
Productnucleotidyl transferase 
Protein accessionYP_001153980 
Protein GI145591978 
COG category[J] Translation, ribosomal structure and biogenesis
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1208] Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.13003 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0200486 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAG CCCCTGTCGT ACTAGCGGGC GGCACGGCCG GCATATTTGA AAAATTGACA 
GGTCAACTCC CCAAGACTTA CATAAAAATC GGCGGGAAAA GACTATACCA ATACACCGCC
GACGCGTTAC TGGCTATATT CGGAAAAGTC TACGTAGCGG CGCCCCGCCC CGAAGATAAC
CCCTACATCT ATGTAGAAGA AAGAGGGACT GGAGTCGAGA GGGCAATCTC CGCGGCAGAG
GCCTACTTAG GCGCTGAGAC CCATATGCTG ATAGCATATG GCGACGTTTA TGTAGAGAGC
TATGCGTACA GGTCTCTAAT AGAGGCGACA GCTACCACAG GCGCAGACGG CGCTGTGCTC
GCCGTGCCGA GAAAAGCCAC AAAGGGATAC GGCGTACTGG AGACGAAAGC CGGGACCCTC
CTAGCCAAAA TAGGGGGAGA AGGCCAGTGG ATATTCGGAG GATTAGCCCT CCTCCCACGT
GCCGCACTAA GGATAATAGA ACAAGCCGGG CTTTACGAAG GCTTAAACCA AATAGCACAG
CGATCAAAAA TAGCGGTTGT ACCGTGGAGC GGGACGTGGC ATGATGTAAA TCACCCAGAA
GACTTAATGC AACTGCTAGA GTACACAGCG CCGAGGAATA CCATTATTGC TAAAACCGCC
AAGGTAAGCC CCACTGCCGT GTTGGAGGGA CCCGTCGTAA TTGAGGATGG CGTAGAGATA
GACCACTACG CCGTGATAAA AGGTCCTGCC TACATAGGGA AGGGGGCTTT TATAGGAGCC
CACGCACTTA TACGTAACTA CACCGATATA GAGGAGGGCG CCGTCATCGG AAGTAGCACA
GAAGTAAGCC ACAGCCTCAT CTGCGAGAGG GCCACTGTGG GGAGGGGGTC CTTTGTCTCC
TACAGCGTAG TAGGCGAAGA GGCAGTTCTA GAACCCAACA TTGTGACTAT GTCGGTACTC
AGAGAGGGGC GCGATAGACT AGAACCAATA CAAGTAAGAG GCCAGGTATA CTACAAACTA
GGCGCCTTAA TACCGCGGAA AGCCCGAGTA TCCGCAGGCA CAACACTACC TCCAGGAGCT
GGCTGGGACT AA
 
Protein sequence
MKIAPVVLAG GTAGIFEKLT GQLPKTYIKI GGKRLYQYTA DALLAIFGKV YVAAPRPEDN 
PYIYVEERGT GVERAISAAE AYLGAETHML IAYGDVYVES YAYRSLIEAT ATTGADGAVL
AVPRKATKGY GVLETKAGTL LAKIGGEGQW IFGGLALLPR AALRIIEQAG LYEGLNQIAQ
RSKIAVVPWS GTWHDVNHPE DLMQLLEYTA PRNTIIAKTA KVSPTAVLEG PVVIEDGVEI
DHYAVIKGPA YIGKGAFIGA HALIRNYTDI EEGAVIGSST EVSHSLICER ATVGRGSFVS
YSVVGEEAVL EPNIVTMSVL REGRDRLEPI QVRGQVYYKL GALIPRKARV SAGTTLPPGA
GWD