Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1899 |
Symbol | |
ID | 5055396 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1706160 |
End bp | 1707332 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640469448 |
Product | pullulanase |
Protein accession | YP_001154102 |
Protein GI | 145592100 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4945] Membrane-anchored protein predicted to be involved in regulation of amylopullulanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTCAA AAGAAGTACT CCTTGCAGTT ATGTTAGCCG CTTTGATATA TGCACAGGGG GTATTTACTG TACAAACGGC CACTGACCCA ACAGGTGATT TCAAAGGACC TGGATGGTTT GTCCCGCCTC AGAATCCCGT TTTTAAAAAC GGGACTGTAT TTGATCTCAC AAAGTTTGAA GTCCTTTATA ATGCCACGGC AGACGCACTA GTCTTTAGAC TAACCTTCGC TGACCTCGGC GGCAACCCGT GGGGCTCCGA GACAGGCTTC TCGTTGCAGT ATGTGCAGAT ATACATAAGC AGAGGCTTCC CTGGCAACCC GTGGGGGACA GTATCGTGCA CGATCCTAAG ACCTGACGAC GGCGATGTGG CCTCGGGCAA CGCCTTTTTT GACGAGGCCA CGAGATTCTT CTGCCCCGAT CCCGCCAACT TGACGCAGTT TAAATACACG CCGGGGGTGA AGTTCTCAAG CCAAGCCCCG TGGGACGTCG CGATTTTCAT AGGCCCCAAG TGGGGCAACG AGACTGTTAA CTTCGTCGCA GTTGCAGATG TGACGGGTGG CACCATAAGC GTCTCGCCAC TCCCGCGCGT CTACGCACAG GGCAACGCCA TAGTGGCAGT TGTGCCCAGG AAGCTAATAC CGCCAACCAC GAGGCTAATG AGCGATTTCC CACAACCAAG CTGGAGGTAC TACGTGTTGG TCACCTCTTA CGACGGCAAC GGTCCCGGCC GCATTAGACC CTTCGGACCT ATGGCCCAGG AGTGGACAGT GGGCGTAGGT ACCGCTAACG CCTCTTCTGT TTTATCAGGA ACTATTCCTA GAGTGCTCGA TGTACTAGGT CCTAACACTC CGTTGAGAAC TTTCACTAAG GATGAGCCAG CAACGCTGGA GCCCCAGACG CCGAGCTGGG GCAACTTCCC GCTAGCCTAC ACAACCACCA CCGTTAATAA AATAGTGCCG CTCACAGTAA CCAAGACGGA CACATTAACA CTTACGGAAA CCGCATACAT AACGTTGACC ACCACAAGAG TGGAAACATT AACCAGAGTA GAGACTTTTA CCCAGGTCAA CGTGGTGGAG AAGCCTTACG TCGATCCGGT AAGTTACGTC GTGTTGGGTA TCGGCGTAAT CGCCGGTATT GTGGGGGCGC TTGCCGCGGC GAGGAGAAAA TAA
|
Protein sequence | MKSKEVLLAV MLAALIYAQG VFTVQTATDP TGDFKGPGWF VPPQNPVFKN GTVFDLTKFE VLYNATADAL VFRLTFADLG GNPWGSETGF SLQYVQIYIS RGFPGNPWGT VSCTILRPDD GDVASGNAFF DEATRFFCPD PANLTQFKYT PGVKFSSQAP WDVAIFIGPK WGNETVNFVA VADVTGGTIS VSPLPRVYAQ GNAIVAVVPR KLIPPTTRLM SDFPQPSWRY YVLVTSYDGN GPGRIRPFGP MAQEWTVGVG TANASSVLSG TIPRVLDVLG PNTPLRTFTK DEPATLEPQT PSWGNFPLAY TTTTVNKIVP LTVTKTDTLT LTETAYITLT TTRVETLTRV ETFTQVNVVE KPYVDPVSYV VLGIGVIAGI VGALAAARRK
|
| |