Gene BURPS1710b_A0635 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A0635 
SymbolhipA 
ID3693459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp834824 
End bp836692 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content71% 
IMG OID637730888 
ProducthipA protein 
Protein accessionYP_335793 
Protein GI76819740 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID[TIGR03071] HipA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.427035 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCGCT CATCCGAAAT ACAGATTCAA CCTTGTAAAT TCTATTTACA GTGTATAAAC 
TGTTTTTCTC GCATACAGGC GAAAATTTGT ATTCCAACCA GGTGCGCACA TGGCCATCCT
CATCGAGCAC GAGATCAAGA CACTCGGCCA GCTGCGGCCG ATTCTGCGCG GCTTCCGCAA
ATCGGCCGGA TTGACGCAGG CGATACTCGC GAGCCGCCTC GGCGTCACGC AGCAGACCTA
CGCGCAGTTC GAGGCGAACC CGGCGTCGGC GAGCGTCGAG CGGCTGTTCA AGGTGCTGCG
CGCGCTCGAC ATCGAACTCA CGCTCACGCT CACGCAGGTC TACGCCGCGC CCGCGGGCAA
GGACAAGGGC GAGGTTGCGA AGACGGCCGC AGGCGCGCGC GCGCGCGCGG GCGCGCGACG
TGCCGTGCCG CCCGCGTCCG CGCCCGCTCC GAGCGCCGCC GGGCGCGCGC CCCGCCCCGC
CCGCAAGCGC GCCGCCCCGA AAAAGCGGGA GGACTGGTGA GCGCCCGCCG CGCACGCGCG
ACGCGCCTGC ACCTGTGGAT GAACGGCCTG CCCGTCGGCT ACTGGGAGCA CGCGCGCGAC
GGCGAGCGCC TTGTCTACTT CGACGAATGG ATCGGCGATC CGCAAGGCCG GCCGCTGTCG
CTGTCGCTGC CGTTCACGCC GGGCAACCAG CCGTATCGCG GTCGGCTCGT CAGCGATTAT
TTCGACAACC TGCTGCCCGA CAGCGAGCCG ATCCGCCGGC GAATCGCGAT GCGCTACCGC
ACGGGCGGCA CGTCCGCGTT CGCGCTGCTC GCGACGCTCG GCCGCGATTG CGTCGGCGCG
CTGCAGATGC TGCCGCCCGA CGAAGCGCCG GACGACATCG AACGCATCCG CGGCCACGCG
CTCGCCGACG CGGACATCGC GCGCCTGCTG CGCGAAGTCA CGTCCGCGCC GCAGGCCGGC
CGGCACGCGC CGCTCGACGA TCTGCGCCTG TCGATCGCCG GCGCGCAGGA GAAGACCGCG
CTGCTGCGCC ATCGCGGCCG CTGGCTGCTG CCCGAAGGGA GCACGCCGAC CACGCACATC
CTGAAGCTGC CGCTCGGGCT CGTCGGCAAC CGGCGCGCCG ACATGCGCAC GTCGGTCGAG
AACGAATGGC TGTGCGCGCG GATCGTCGCC GCGTACGGGT TGCCCGTCGC GCGCTGCGAC
ATCGCTCAGT TCGACGATCA GAAAGCGCTC GTCGTCGAGC GCTTCGACCG CCGGCCGTCG
CGCGACGCAC GCTGGCTCCT GCGGCTGCCG CAGGAAGACA TGTGCCAGGC AACCGGCACG
TCCGCGCTCG ACAAATATCA GGCCGACGGC GGCCCCGGCA TCGAGACGAT CATGGAAGTG
CTCGCCGGCT CCGAGCACGC GCGGGACGAC CGCCGCGCGT TCTTCGCGGC GCAGATCGTG
TTCTGGCTGC TCGCCGCGAC CGACGGCCAC GCGAAGAACT TCAGCATCGC GCACCTGCCC
GGCAACCGCT ACCGTTCGAC GCCGCTTTAC GACGTGCTGT CCGCGCATCC GGTCATCGGC
CGGGGCGCGA ACCAGTTGCC CGCGCAGCGC GCGCGGCTCG CGATGGGCGT GCGCGGCAAG
CACATCCACT ATCCGCTGCA CCAGATCCGG CGGCGGCACT GGATCGCGCA GGGCCAGCGC
GTCGGCTTCG CGCCCGCCGA CGTCGACGCG CTGATCGACA CGCTGACCGC GCGCACCGCG
GGCGTCGTCG ACGCGGTGTC GGCGCGGCTG CCGCGCGATT TTCCGCGCGA CGTCGCCGAT
GCGATCTTCA GCGGAATGCT CGGCCTGAGC GCAAGGCTCG CCGGCGACGC GGCCGCGCGC
GCACCATGA
 
Protein sequence
MPRSSEIQIQ PCKFYLQCIN CFSRIQAKIC IPTRCAHGHP HRARDQDTRP AAADSARLPQ 
IGRIDAGDTR EPPRRHAADL RAVRGEPGVG ERRAAVQGAA RARHRTHAHA HAGLRRARGQ
GQGRGCEDGR RRARARGRAT CRAARVRARS ERRRARAPPR PQARRPEKAG GLVSARRARA
TRLHLWMNGL PVGYWEHARD GERLVYFDEW IGDPQGRPLS LSLPFTPGNQ PYRGRLVSDY
FDNLLPDSEP IRRRIAMRYR TGGTSAFALL ATLGRDCVGA LQMLPPDEAP DDIERIRGHA
LADADIARLL REVTSAPQAG RHAPLDDLRL SIAGAQEKTA LLRHRGRWLL PEGSTPTTHI
LKLPLGLVGN RRADMRTSVE NEWLCARIVA AYGLPVARCD IAQFDDQKAL VVERFDRRPS
RDARWLLRLP QEDMCQATGT SALDKYQADG GPGIETIMEV LAGSEHARDD RRAFFAAQIV
FWLLAATDGH AKNFSIAHLP GNRYRSTPLY DVLSAHPVIG RGANQLPAQR ARLAMGVRGK
HIHYPLHQIR RRHWIAQGQR VGFAPADVDA LIDTLTARTA GVVDAVSARL PRDFPRDVAD
AIFSGMLGLS ARLAGDAAAR AP