Gene Pars_1778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1778 
Symbol 
ID5055532 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1598771 
End bp1600579 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content54% 
IMG OID640469323 
Productglucosamine--fructose-6-phosphate aminotransferase 
Protein accessionYP_001153981 
Protein GI145591979 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0449] Glucosamine 6-phosphate synthetase, contains amidotransferase and phosphosugar isomerase domains 
TIGRFAM ID[TIGR01135] glucosamine--fructose-6-phosphate aminotransferase (isomerizing) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.285931 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0291099 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTGCGGTA TTTTTGGCAT AGTCTTCGCC GAGCGGCCAA GACGGCCGCT CGGCGAGATT 
CTGCGTAGGA GTTTAGAGAG GCTTGAATAT CGAGGCTACG ACTCAGCCGG GGTTGCTGTA
GTGGACAGGG GGTTGGTTGT GAGAAAAGAC GCGGGAAAAG TTGCGGAGGT GGCGTCACGC
CACGGTTTTG ACGCGTTGCA AGGCGTCGCG GGACTTGCCC ATACCAGATG GGCCACGCAC
GGCGCTCCTA ACCAAATCAA CGCCCATCCT CACACTGACT GTAGGGGAGT GTTGGCAGTA
GTCCACAATG GGATTATAGA AAACTACGCC GAGCTTAGAG AAGAACTCGC AAAGAGGGGT
CACATATTTC GCACCGAGAC CGACACAGAG GTTTTTGTGC ACTTGGTGGA AGAATACAAG
AGACAGGGGC TCGACACTTT CTCCGCGTTT AAAAAAGCGC TGGCAAGAAT AAAAGGCGCC
TACGCCATCG CCCTAATAGA CGTCGAGAAT CCGCAAGTTA TATACTTCGC CAGGAATCTC
TCGCCTCTTA TAATCGGCGT GGGGGATGGT TTTAACATCG TCGCCAGCGA CATACCGACT
GTTCTCGACC ACACCAGGAG AGTAATAGCC ATAAAAGATG GCGAGTACGG CTATATAACT
CCCACAGAGG TATACATAGA ATCAGACGGC GTGCCGCAAG ATGCCGCCTC TAGAATTGAG
GAGATCCCTT GGAGCGCCGA GATGGCTACA AAGGGAGGCT ACCCCCACTT TATGCTAAAG
GAGATATATG AGCAGCCTGA GTCGCTTGCC TTCACTGTCG CCGGGCTTGA GCCCGCCCAG
CTTGAGGCTG TGGCCAACGC CGTCCTGTCC GCAAGGAATG TGTACTTGGT TGGCGCCGGG
ACGTCCTACC ACGCCGGGCT CACATTGGCA TATCTGCTCC CCAGGTTGAG AATCACAGCT
ATTCCCATTA TATCGTCTGA GTACGCCATA TACGAGAACC TCTTTGACAG AGACGACGTG
GCCATCGTCG TATCGCAGTC TGGAGAGACC ATAGATACCA TAAAAGCTGC AAGAGCCATG
AGAGAAAAAG GCGTGAGGGT AGTGGCGGTG ACCAACGTGG TTGGAAGTAC TCTCTCCCGA
GAAAGCGACG CTACAATATA CACAAGAGCC GGGCCAGAAA TCGGAGTAGC CGCTACAAAG
ACTTTTACTA CCCAAGTCCT CACTCTAGGC GCGCTGTATG TCACGGCGCT GAGTCTGTTG
GGATACGACG TTTCTCAACA CGTCGACGAG ATGAAAAAGG TGCCGGATCT GGCGAGGAAG
ACTATAGAAA ACACCGCAGG CACGGCAAAA GATCTCGCCA GGAGGCTAAG AAATAGGCCA
AGCGCCTACT ACCTAGGAAG GGGGGCTGCC TTGCCTGTCG CCATGGAGGG GGCACTAAAG
TTGAAAGAGG TGGCATATAT ACACGCTGAG GCCTACTCGG CAGGCGAGTC CAAACATGGT
CCAATAGCAC TTGTGGAGCC AGGCTTCCCC ACACTATTCG TATTCTCAGA CCCCCAGACT
AGGGAGAAGA CGTTGAGCAA CGTCGCAGAG ATGAAGGCAA GAGGGGCGTT AACCATTGGC
ACAGTTCCGG CAAAAAGCGA CTACGCCAAG AAGCTGGACG TGGCGATAGA GGTTCCAGAG
ATGGGCGACG TCTTCGCCCC AATTATCCAC GTCATCCCAT TGCAGATGCT CGCCTACTTC
GCCGCCGTTG AGAGGGGGTA CGACCCCGAC AAACCGAGAA ATCTAGCGAA AACCGTGACA
GTGGAATAA
 
Protein sequence
MCGIFGIVFA ERPRRPLGEI LRRSLERLEY RGYDSAGVAV VDRGLVVRKD AGKVAEVASR 
HGFDALQGVA GLAHTRWATH GAPNQINAHP HTDCRGVLAV VHNGIIENYA ELREELAKRG
HIFRTETDTE VFVHLVEEYK RQGLDTFSAF KKALARIKGA YAIALIDVEN PQVIYFARNL
SPLIIGVGDG FNIVASDIPT VLDHTRRVIA IKDGEYGYIT PTEVYIESDG VPQDAASRIE
EIPWSAEMAT KGGYPHFMLK EIYEQPESLA FTVAGLEPAQ LEAVANAVLS ARNVYLVGAG
TSYHAGLTLA YLLPRLRITA IPIISSEYAI YENLFDRDDV AIVVSQSGET IDTIKAARAM
REKGVRVVAV TNVVGSTLSR ESDATIYTRA GPEIGVAATK TFTTQVLTLG ALYVTALSLL
GYDVSQHVDE MKKVPDLARK TIENTAGTAK DLARRLRNRP SAYYLGRGAA LPVAMEGALK
LKEVAYIHAE AYSAGESKHG PIALVEPGFP TLFVFSDPQT REKTLSNVAE MKARGALTIG
TVPAKSDYAK KLDVAIEVPE MGDVFAPIIH VIPLQMLAYF AAVERGYDPD KPRNLAKTVT
VE