Gene Pars_0206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0206 
Symbol 
ID5054724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp185245 
End bp186657 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content61% 
IMG OID640467785 
Product4-alpha-glucanotransferase 
Protein accessionYP_001152473 
Protein GI145590471 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1640] 4-alpha-glucanotransferase 
TIGRFAM ID[TIGR00217] 4-alpha-glucanotransferase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.539313 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTACGGG GGGCAGGCGT TTTACTCCAC ATAACTTCAC TCCCGGGCGG TTGCTACGTG 
GGTGATCTAG GCCCGGAGGC GTATAAATTC GCCGAGTTTT TAGCAGAGGC GGAGCAGACC
TACTGGCAGA CGCTACCTAT TAACCACAGC GTGCCGGAGT ACGAGAACTC TCCCTACAGC
GCCGTGTCGA GCTTCGCAGG GGATCCAAAA CTGATAAGCC TGGACCTCAT GAAGAGGGAG
GGCCTCATAG ACCAAGTGCC GGATTGTCCC CCTGCCGAGA GGGTCGACTA CGCCGCGGCG
TGGGAGGTGA AGAAGAAGGC GCTGGAAAAG GCGCTGAGGA GGGGCAAGAA GCTCAGCGAC
TACAAGAACT TCGTGGAGTC CACCCCTTGG CTTGAAGACT ACGCCTACTA TATGGCCATG
AGGGACCTCT ACGGGCCCTG GCCGAAGTGG CCGAGGAGAG ATCCGCCGGG GGAGCTGGTG
GAGCTTTACA AATTCGCCCA GTTCGTCTTC TGGCGCCAGT GGCGGGAGCT CAAGCAGTAC
GTAAACAGCT TGGGTATATT CTTAATAGGG GACCTCCCCA TATACCCCAG CCTAGACAGC
GCCGACGTGT GGAGACACAG GCGGTACTTC AAAATCACAG AGGACGGCGC CCCCCTCTAC
GTGGCCGGCG TCCCGCCGGA CTACTTCTCG CCGACTGGCC AGCTCTGGGG CAACCCAGTA
TACAACTGGG AAGCCTTGAG GGCCGACGGT TACAGGTGGT GGCTAGACCG GCTGAGGCAC
ACGCTGTCCG CATTTGACTA CGTGAGGCTG GACCACTTCC GCGGATACGT GGCCTACTGG
GAGGTCCCTG CCGGCGAGAA GACGGCGGTG AACGGTCGGT GGGTCCCCGC GCCGGGGGCG
GAGCTACTGG AAAAAGCCAG GTCGGAGCTG GGGGAGCTTA GGCTAATCGC AGAGGACCTC
GGCTACATAA CGCCAGACGT GGTGGAGCTG AGAGACCGCC TCGGCTTCCC CGGCATGCGT
GTCTTGCAGT TCGCCTGGGA CGGCAACCCC GCAAACGAGC ACAAGCCACA CAACCACGTC
AAAAACTCCG TGGTGTACAC CGGCACCCAC GACAACAACA CGGCGGTGGG GTGGTATCTA
GAAGAGGCGA CGCCGAGAGC GAGGCGGGAG TTTTGCCAGT ATGCGAAGTG CTCAGCCGCG
GAGGGCGTAC ACTGGTGTTT CATCAGGCTG GCCTACATGT CAGTTGCCAA CGTAGCGATC
GTGCCTATAC AAGACGTGCT GGGCCTTGGT AGCGAGGCGC GGATGAACAA GCCAGGCACA
GTGGGGGGTA ACTGGAGGTG GAGGCTGGCA AAGATGCCCA ACGCCGCCGT GAGGAGGCGG
CTGAGAAAAC TAACCCGCAT ATACGGGCGT TGA
 
Protein sequence
MLRGAGVLLH ITSLPGGCYV GDLGPEAYKF AEFLAEAEQT YWQTLPINHS VPEYENSPYS 
AVSSFAGDPK LISLDLMKRE GLIDQVPDCP PAERVDYAAA WEVKKKALEK ALRRGKKLSD
YKNFVESTPW LEDYAYYMAM RDLYGPWPKW PRRDPPGELV ELYKFAQFVF WRQWRELKQY
VNSLGIFLIG DLPIYPSLDS ADVWRHRRYF KITEDGAPLY VAGVPPDYFS PTGQLWGNPV
YNWEALRADG YRWWLDRLRH TLSAFDYVRL DHFRGYVAYW EVPAGEKTAV NGRWVPAPGA
ELLEKARSEL GELRLIAEDL GYITPDVVEL RDRLGFPGMR VLQFAWDGNP ANEHKPHNHV
KNSVVYTGTH DNNTAVGWYL EEATPRARRE FCQYAKCSAA EGVHWCFIRL AYMSVANVAI
VPIQDVLGLG SEARMNKPGT VGGNWRWRLA KMPNAAVRRR LRKLTRIYGR