Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0206 |
Symbol | |
ID | 5054724 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 185245 |
End bp | 186657 |
Gene Length | 1413 bp |
Protein Length | 470 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640467785 |
Product | 4-alpha-glucanotransferase |
Protein accession | YP_001152473 |
Protein GI | 145590471 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1640] 4-alpha-glucanotransferase |
TIGRFAM ID | [TIGR00217] 4-alpha-glucanotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.539313 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTACGGG GGGCAGGCGT TTTACTCCAC ATAACTTCAC TCCCGGGCGG TTGCTACGTG GGTGATCTAG GCCCGGAGGC GTATAAATTC GCCGAGTTTT TAGCAGAGGC GGAGCAGACC TACTGGCAGA CGCTACCTAT TAACCACAGC GTGCCGGAGT ACGAGAACTC TCCCTACAGC GCCGTGTCGA GCTTCGCAGG GGATCCAAAA CTGATAAGCC TGGACCTCAT GAAGAGGGAG GGCCTCATAG ACCAAGTGCC GGATTGTCCC CCTGCCGAGA GGGTCGACTA CGCCGCGGCG TGGGAGGTGA AGAAGAAGGC GCTGGAAAAG GCGCTGAGGA GGGGCAAGAA GCTCAGCGAC TACAAGAACT TCGTGGAGTC CACCCCTTGG CTTGAAGACT ACGCCTACTA TATGGCCATG AGGGACCTCT ACGGGCCCTG GCCGAAGTGG CCGAGGAGAG ATCCGCCGGG GGAGCTGGTG GAGCTTTACA AATTCGCCCA GTTCGTCTTC TGGCGCCAGT GGCGGGAGCT CAAGCAGTAC GTAAACAGCT TGGGTATATT CTTAATAGGG GACCTCCCCA TATACCCCAG CCTAGACAGC GCCGACGTGT GGAGACACAG GCGGTACTTC AAAATCACAG AGGACGGCGC CCCCCTCTAC GTGGCCGGCG TCCCGCCGGA CTACTTCTCG CCGACTGGCC AGCTCTGGGG CAACCCAGTA TACAACTGGG AAGCCTTGAG GGCCGACGGT TACAGGTGGT GGCTAGACCG GCTGAGGCAC ACGCTGTCCG CATTTGACTA CGTGAGGCTG GACCACTTCC GCGGATACGT GGCCTACTGG GAGGTCCCTG CCGGCGAGAA GACGGCGGTG AACGGTCGGT GGGTCCCCGC GCCGGGGGCG GAGCTACTGG AAAAAGCCAG GTCGGAGCTG GGGGAGCTTA GGCTAATCGC AGAGGACCTC GGCTACATAA CGCCAGACGT GGTGGAGCTG AGAGACCGCC TCGGCTTCCC CGGCATGCGT GTCTTGCAGT TCGCCTGGGA CGGCAACCCC GCAAACGAGC ACAAGCCACA CAACCACGTC AAAAACTCCG TGGTGTACAC CGGCACCCAC GACAACAACA CGGCGGTGGG GTGGTATCTA GAAGAGGCGA CGCCGAGAGC GAGGCGGGAG TTTTGCCAGT ATGCGAAGTG CTCAGCCGCG GAGGGCGTAC ACTGGTGTTT CATCAGGCTG GCCTACATGT CAGTTGCCAA CGTAGCGATC GTGCCTATAC AAGACGTGCT GGGCCTTGGT AGCGAGGCGC GGATGAACAA GCCAGGCACA GTGGGGGGTA ACTGGAGGTG GAGGCTGGCA AAGATGCCCA ACGCCGCCGT GAGGAGGCGG CTGAGAAAAC TAACCCGCAT ATACGGGCGT TGA
|
Protein sequence | MLRGAGVLLH ITSLPGGCYV GDLGPEAYKF AEFLAEAEQT YWQTLPINHS VPEYENSPYS AVSSFAGDPK LISLDLMKRE GLIDQVPDCP PAERVDYAAA WEVKKKALEK ALRRGKKLSD YKNFVESTPW LEDYAYYMAM RDLYGPWPKW PRRDPPGELV ELYKFAQFVF WRQWRELKQY VNSLGIFLIG DLPIYPSLDS ADVWRHRRYF KITEDGAPLY VAGVPPDYFS PTGQLWGNPV YNWEALRADG YRWWLDRLRH TLSAFDYVRL DHFRGYVAYW EVPAGEKTAV NGRWVPAPGA ELLEKARSEL GELRLIAEDL GYITPDVVEL RDRLGFPGMR VLQFAWDGNP ANEHKPHNHV KNSVVYTGTH DNNTAVGWYL EEATPRARRE FCQYAKCSAA EGVHWCFIRL AYMSVANVAI VPIQDVLGLG SEARMNKPGT VGGNWRWRLA KMPNAAVRRR LRKLTRIYGR
|
| |