Gene Pars_1899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1899 
Symbol 
ID5055396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1706160 
End bp1707332 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content53% 
IMG OID640469448 
Productpullulanase 
Protein accessionYP_001154102 
Protein GI145592100 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4945] Membrane-anchored protein predicted to be involved in regulation of amylopullulanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTCAA AAGAAGTACT CCTTGCAGTT ATGTTAGCCG CTTTGATATA TGCACAGGGG 
GTATTTACTG TACAAACGGC CACTGACCCA ACAGGTGATT TCAAAGGACC TGGATGGTTT
GTCCCGCCTC AGAATCCCGT TTTTAAAAAC GGGACTGTAT TTGATCTCAC AAAGTTTGAA
GTCCTTTATA ATGCCACGGC AGACGCACTA GTCTTTAGAC TAACCTTCGC TGACCTCGGC
GGCAACCCGT GGGGCTCCGA GACAGGCTTC TCGTTGCAGT ATGTGCAGAT ATACATAAGC
AGAGGCTTCC CTGGCAACCC GTGGGGGACA GTATCGTGCA CGATCCTAAG ACCTGACGAC
GGCGATGTGG CCTCGGGCAA CGCCTTTTTT GACGAGGCCA CGAGATTCTT CTGCCCCGAT
CCCGCCAACT TGACGCAGTT TAAATACACG CCGGGGGTGA AGTTCTCAAG CCAAGCCCCG
TGGGACGTCG CGATTTTCAT AGGCCCCAAG TGGGGCAACG AGACTGTTAA CTTCGTCGCA
GTTGCAGATG TGACGGGTGG CACCATAAGC GTCTCGCCAC TCCCGCGCGT CTACGCACAG
GGCAACGCCA TAGTGGCAGT TGTGCCCAGG AAGCTAATAC CGCCAACCAC GAGGCTAATG
AGCGATTTCC CACAACCAAG CTGGAGGTAC TACGTGTTGG TCACCTCTTA CGACGGCAAC
GGTCCCGGCC GCATTAGACC CTTCGGACCT ATGGCCCAGG AGTGGACAGT GGGCGTAGGT
ACCGCTAACG CCTCTTCTGT TTTATCAGGA ACTATTCCTA GAGTGCTCGA TGTACTAGGT
CCTAACACTC CGTTGAGAAC TTTCACTAAG GATGAGCCAG CAACGCTGGA GCCCCAGACG
CCGAGCTGGG GCAACTTCCC GCTAGCCTAC ACAACCACCA CCGTTAATAA AATAGTGCCG
CTCACAGTAA CCAAGACGGA CACATTAACA CTTACGGAAA CCGCATACAT AACGTTGACC
ACCACAAGAG TGGAAACATT AACCAGAGTA GAGACTTTTA CCCAGGTCAA CGTGGTGGAG
AAGCCTTACG TCGATCCGGT AAGTTACGTC GTGTTGGGTA TCGGCGTAAT CGCCGGTATT
GTGGGGGCGC TTGCCGCGGC GAGGAGAAAA TAA
 
Protein sequence
MKSKEVLLAV MLAALIYAQG VFTVQTATDP TGDFKGPGWF VPPQNPVFKN GTVFDLTKFE 
VLYNATADAL VFRLTFADLG GNPWGSETGF SLQYVQIYIS RGFPGNPWGT VSCTILRPDD
GDVASGNAFF DEATRFFCPD PANLTQFKYT PGVKFSSQAP WDVAIFIGPK WGNETVNFVA
VADVTGGTIS VSPLPRVYAQ GNAIVAVVPR KLIPPTTRLM SDFPQPSWRY YVLVTSYDGN
GPGRIRPFGP MAQEWTVGVG TANASSVLSG TIPRVLDVLG PNTPLRTFTK DEPATLEPQT
PSWGNFPLAY TTTTVNKIVP LTVTKTDTLT LTETAYITLT TTRVETLTRV ETFTQVNVVE
KPYVDPVSYV VLGIGVIAGI VGALAAARRK