Gene BURPS1106A_A2038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2038 
Symbol 
ID4904441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2010722 
End bp2012155 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content72% 
IMG OID640145143 
Producthypothetical protein 
Protein accessionYP_001076071 
Protein GI126456857 
COG category[S] Function unknown 
COG ID[COG3522] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03353] type VI secretion protein, VC_A0114 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCACG ACGCATCACG AACCGGAACG AACGACATGG ACAACGTCTA TTGGCATCAG 
GGGATGCTGC TGCAACCGCA ACATTTTCAG TTGGCCGAAC TGCACCAGCA GTTCCGCTTC
GAGCCGTGGC TCGCGTGCGG CCCGCCGCAT TTCTGGGGCG TCGGCGCGCT GTCGCTCGCG
CAGGCCGCGA TCGATCGCCG CGTGGTCGAG ATCCGCTCGG CGCGCCTGCT GTTCGCCGAT
CGCAGCTACG TCGAATATCC GGGCAACGCG GTCGTCGCCG CGCGCGCGTT CGATCCCGCG
TGGCTCGACG AAGGCCGCGC GCTCGTCGCG CACGTCGCGC TCAGGCGGCT CGCGCGCGGC
GCGAACAACG TGACGGTCGC GGCCGCGCCC GACGCGCTGC CCGACGCCCC GACGCGCTAC
GCGACGCTGC CGTCCGCCGA GGAGGTCGCC GATCTGCATT CGGACCATCC GGGCGCGCCG
GTGCGCACGC TCAAGCACGT GCTGAAGATC GTGTTCGAGC ACGAGCTCGA CGCGCTCGCC
GCGCACGAAA CGATCCCGAT CGCGCGGATC GTGCGCGACG GCGAGCGCCT GCGGCTCGAC
GACGATTTCG CGCCGCCCTG CTACGCGCTG TCGGGCTCGC GCACGCTGCT CGAGCGCGTG
CGCTGCATTC GCGACGAGCT CGCGGGCCGC GCGCGGCAGT TGCAGCAGTA CAAGAATCCG
CGCGAGATGC AACGCGCCGA ATTCGACGCG AGCTATGCGG CGTTCCTGCT CGCGCTGCGT
TCGCTGAACC GCTTCGGCCC GCTGCTGTTC CATCTCGCCG AATGCGACGG GCAGCATCCA
TGGACGGTCT ACGGCGTGCT GCGCCAACTC GTCGGCGAGC TGAGCGTGTT CTCCGAGCGC
TTCGACATGC TCGGCGAGAC GCCCGATGCG CGCGGCGGCC TGCCGCCGTA CGACCACCGC
GATCTGGGCG GCTGCTTCTC GCGCGCGCAC GCGCTGATCG GCCACCTGCT CGACGAAATC
GCGGTGGGCC CGGACTGCGT CGCGACGTTC GAGCCCGACG GCCCGCAGCA GCCCGCGCAA
CGCTCGGCGC AACTGCCGCC CGACGTGTTC GCGGATCGCC ACCAGATCTA TCTCGCGATC
CGCAGCGCGC ACGATCCGGA CACGCTCGCG CAACGCTTCG CGCTCGGCGG CCGGATCGCG
GCGACCGACG AAATGCCGCA GCTCACCGCG CTCGCGCTGC CGGGCGTCGA ACTCACCCGC
CTGCCCGGCC CGCCGCCGCG GCTGCCGCGC CGCGGCGACG CGCGCTACTT CCGGATCGAG
CAGGCCGGCC GCCCGTGGGA CGCGATCCGG CGTGACGGCC GCGTGTCGCT GCGCTGGGCC
GACGCGCCGG ACGACCTGCA CGCGGAACTC GTCGCGGTGA GGCACACGCA ATGA
 
Protein sequence
MTHDASRTGT NDMDNVYWHQ GMLLQPQHFQ LAELHQQFRF EPWLACGPPH FWGVGALSLA 
QAAIDRRVVE IRSARLLFAD RSYVEYPGNA VVAARAFDPA WLDEGRALVA HVALRRLARG
ANNVTVAAAP DALPDAPTRY ATLPSAEEVA DLHSDHPGAP VRTLKHVLKI VFEHELDALA
AHETIPIARI VRDGERLRLD DDFAPPCYAL SGSRTLLERV RCIRDELAGR ARQLQQYKNP
REMQRAEFDA SYAAFLLALR SLNRFGPLLF HLAECDGQHP WTVYGVLRQL VGELSVFSER
FDMLGETPDA RGGLPPYDHR DLGGCFSRAH ALIGHLLDEI AVGPDCVATF EPDGPQQPAQ
RSAQLPPDVF ADRHQIYLAI RSAHDPDTLA QRFALGGRIA ATDEMPQLTA LALPGVELTR
LPGPPPRLPR RGDARYFRIE QAGRPWDAIR RDGRVSLRWA DAPDDLHAEL VAVRHTQ