Gene BURPS668_A2972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2972 
Symbol 
ID4887561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2828431 
End bp2830311 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content72% 
IMG OID640132908 
Producthypothetical protein 
Protein accessionYP_001063963 
Protein GI126444448 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.463133 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTCTTGCG CCGCGTGCGC GAGCCTTTTC ATCGCCGCCG CGGGCGCGTC GCCGCTCGAT 
GCCGATTCCG CCGACGCCGA CGCCGCGCCC GCGGCGGTCG TGAGCGACGT CCACGTGTTC
GTCGTCCAGC GCGACGGCTC CGTCGACGAG CATGACGATT CGACGTTGCG CGCGAACACC
GCGAGCGGCA TCGATGACGT CGCGCAGCGC TACGTATGGT TCAACAAGGA CATCGATCGC
GTCGAGCTGC TGCGCGCCGA GACGATCGAT CGCGCGGGCG TCGCGCATCC GGTGGGCCCC
GAGGCGATTC GCGACGTGCA GGAGCCGCGC TCGGCCGGCG CGCCGTTCTT CGAGGACGGC
GTGCTGCGCT CGGTGATCTT TCCGGGCGTC GACGCCGGCG CGCGCACGCG CCTCGTGTTC
CGCAAATCGC GCACGAAGCC GCGCGATCCC GGCTATTTCG GCTACTTCGC CGCGCCGTCG
CGCATGCCCG TCGAAGCGCA GCGCCTGATC TTCGATCTGC CCGCCGACAT GCCGCTCTAC
GCGGACGCGC GCGGCTACGT CGCGCGCGCG CCCGTGACGG AGAACGGCCG CACGCGCTAC
GCATTCGATT ATCGGCACGG CCCGTATCCG CGCATCGAGG AGGGCTCGGT CGGCTACACG
ACGTACGGCG ACCGGCTGAT GGTATCGACG CTGCGCGATT TCGCGGCGTT CGCCGGGCGG
TATCGCGCGG CGGCCGCCGA CGCGAGCGCG GCCGATCCCG CCGTCGCGCG GCTCGCGCAA
GCGATCGTCG CGAATGCGGA CGCGGCCGGG CCGCGTGCGA AGGCGCGGGC GATCTACGAC
TGGGTGCGCA CCCACGTGCG TTACGTCGCG CTCTTTCTCG GCGAGACCGC GGCCGCGCCG
CACAAGGTGA CGGATATCCT GCGCCACCGC TACGGCGACT GCAAGGATCA CGTCGCGCTG
TTCGGCGCGC TGCTCGCGGC GGCCGGCATA CGCAGCGAGC CGGCGCTGCT GAATCTCGGC
GCCGTCTATA CGCTGCCGGA CGTGCCGGGC TACGGCGGCG GCGCGATCAG CCATGCGATC
ACGTGGCTGC CGGACCTCGG GCTGTTCGCC GATACGACGG TGGGCGGCGT CGAGTTCGGC
TATCTGCCTC CCGTCGTGAT GGATCGGCCC GTGCTCCTCG TCGACGACGG CGTGCTGTCG
CGCACGCCCG CCGCGCAGCC GCGCACGCGC GACGCGCGCC TGCGGATCGA TGTCGCGCCG
GATGGCGACG CGCGCTATCA GTATCACGTC GAGGACGGCG GCTGGCCCGC GGAGTTCGAG
CGCGGCGCGT TCCGGCAGGC CGCGCGCGAG CGCGTGCGGC AACTCGCGGC CGATCGGCTG
CGCCAGAGCG GCTTGCGCGG CACCGCGCGG CTCGGCACGA GCGAGCGCGA CGTGACGGGC
GGGCCGTTCT CGACTTCGAT GACGGGTGCG CTCGAACGCT TCGCCTGGCC CGACGGCACG
ACCGCGCTGC CGGCGCTGTC GAGCCTCGCG GGCGGCATCG CGACGCAGGT GCAGGGCTGG
CTCGCGGAGC CCGTGCGCAC GCAGCCGTGG CTGTGCATCG GCGCAGACTT CGACGAGACC
GCGCAGATCG CGCTGCCCGA GAACCTGCGC GTGACCGATC TGCCTGCCGA CGCGGGCGTG
CACGATCGCT TCTTCGATTA CGAATCGCAT TACGTGTTCG ACGCGCCGGC GCGTGTCGTG
CAGATCACGC GGCGCTTGCG CGCGCGCTTC GCGCACCAAG TGTGCGCGCC GGAGGATTTC
GCGGCGGCGC GCGCATCGCT CGAACACATC GAGCGCGACG TGCTCGCCCA GATCGTCGTG
CGGGCGAAGC CGCGCGACTG A
 
Protein sequence
MSCAACASLF IAAAGASPLD ADSADADAAP AAVVSDVHVF VVQRDGSVDE HDDSTLRANT 
ASGIDDVAQR YVWFNKDIDR VELLRAETID RAGVAHPVGP EAIRDVQEPR SAGAPFFEDG
VLRSVIFPGV DAGARTRLVF RKSRTKPRDP GYFGYFAAPS RMPVEAQRLI FDLPADMPLY
ADARGYVARA PVTENGRTRY AFDYRHGPYP RIEEGSVGYT TYGDRLMVST LRDFAAFAGR
YRAAAADASA ADPAVARLAQ AIVANADAAG PRAKARAIYD WVRTHVRYVA LFLGETAAAP
HKVTDILRHR YGDCKDHVAL FGALLAAAGI RSEPALLNLG AVYTLPDVPG YGGGAISHAI
TWLPDLGLFA DTTVGGVEFG YLPPVVMDRP VLLVDDGVLS RTPAAQPRTR DARLRIDVAP
DGDARYQYHV EDGGWPAEFE RGAFRQAARE RVRQLAADRL RQSGLRGTAR LGTSERDVTG
GPFSTSMTGA LERFAWPDGT TALPALSSLA GGIATQVQGW LAEPVRTQPW LCIGADFDET
AQIALPENLR VTDLPADAGV HDRFFDYESH YVFDAPARVV QITRRLRARF AHQVCAPEDF
AAARASLEHI ERDVLAQIVV RAKPRD