Gene BURPS668_A1784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1784 
Symbol 
ID4887731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1728788 
End bp1730851 
Gene Length2064 bp 
Protein Length687 aa 
Translation table11 
GC content69% 
IMG OID640131722 
Producthypothetical protein 
Protein accessionYP_001062779 
Protein GI284159992 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAACG GCGAGCGCCC GCCTGCGCGC CGGCCCGATT CATCCGGCTC GCCGCCGCCC 
GCGGCGGATG CGCCGGCCGC GTCCAACCAT CCTTTTTCGA GCCACGATAC CAAGCACATG
ACATCCCGCC GCCTCGCCTC AAGAACCGCC GTCGCCGCCT CGCTCTCCGC CCTGATGCTC
GCCGCATGCG GGGGCGACGA TTCCGCCAAT GCACCGACCG CGGGCGGCGC CGCGCCGTTG
ACGCCCGCGG TGGCGAGCCC CGCCGGGCCG ACGGGCAGCA CGCCGGGCAG CACGCCGGGC
GCCACGACCG CGCCCGCGCC GAGCAGCACG TCCGCCGGCC AGCTCTCGGT CGACAAGATG
GCATTCGCGC AGACGCACGT GGTGCCGAGC GGCGGCCTGA GCTGGACGCT GCCGAACGCG
AGCGCGAGCC TTCGCCCGAT CAGCAGGCGC GACGCGCTCG TGCTGGTCGC GATCGGCCAG
GCCGACGCGG TGCAGCCCGT TCTCGAAGCG TGGAAGGACG GCGCGAAGCT CGGCGCGCTC
GCACTGAGCC CGCCGTCCGC GCTGCCGCCG ACCGAATCCG GCGGCCGCGC GTATGCGAAC
GATCGATGGA GCGCCGTCGT GCCCGCCGCG TGGATGGTGC CGGGCGTGTC GTTCAGCGTG
TCGGCGTCCA ATTACACGTC GAGCGTCGCG CAAGCGCCCG TGTTCGGCAC CGACGCCGAC
GTGCAACTGA CGATCCTGCC GTTCTACCTG TTCGGCGCCG ATGACACCAA TTCGCCGCCA
CTGTCGACCA CGCAAGCGCC CGACGCCGCC ACGCAGCAGG AAATTTTCGC GAAATGGCCG
ACAGCGGAGC TGAAGGTCCG CACGCATCCG GCCGGGCGCT TCAGCCTCGC GACGGTCGTG
GTCGGCCCCC GCGCCGATCG CACGGGCGCC GCGCAGCCCG CCTATCCGGT GACCGCGCTC
GACCAGCAGA AAGACGGCTA CGGCGTGATG AGCGCGATGC TCACGCTGAT CACGAACATG
CGCACGGCGA ACGGCGACGG CCCGCTCAAC GATCAGTACT ACGCCCCCCT CATCGCGCTG
AACTCGAACG GACAGTTCGC GAACCTCGGC GGCGGTCTGG GCGGCGTCGG CTCGGGCGCG
GCGGTCGGCG ATCACCGTTA TACCGGCATC TTCATTCACG AGCAGGGGCA CGCGTTCGGC
CTCAATCACG CGGGCGACGA GTACGCGAAA GGCGCCTATC CCTATGCGGG CGGCAGCCTG
AGCGGATCGG TCTGGGGCTA CGACCCGAAT CACCGCGAGT TCCTCGACGT GCTCGTGCCC
ACCACCGCAT CGAGCTACGC GAAATGCGCG AGCTCGCATC AGCTCGACGC GCAGGGCCGC
TGCTACAAGC AGGATCCGAT GCAGGGCGGC GCCGGCGATC AGTCGAGCGG ATACAAGTTC
GCGACGTTCT CGGACTACAA CACGGGCCGG ATGCAGGCAT GGATCGCATC GCGCGTGCTG
GCCGATCCGG CGTCGTCGAC GGGCTACAGC AAGTGGGACA GCGCCGCGCA GGCGCGCGCG
CCGTACACGC CGACGACCGA CAACAACGGG CTCTACGGCG TCAACCAGAA TCTGCCCGTT
CAGGCCGGCG TGCCGGTTCA CACGATCGTC GTGAGCTTCA GCAAGGCCGG CTCCGCGGGC
GCGTCGTACA TCTATCCGCC GTTCTCCTAC ACCGGCAACC TGATCGCGAC GTTCGATCCG
ACCTCCGCCG CCGACCGCCA GGCGATCACC GTCGATAAGG GCACGTACCC GTGGTATTGC
AAGGGCACCG GGTGCGACTA CACGCTGCGC GTGACCTATG CGGACGGCAG CCAGACGTAT
CGCGTGCTGC AAGGCGGATT CCGTGCGTGG TGGACGCCCA CCGTCGACGA CGCGAACGCG
ACCAATCCGC TCAGCGGCAG CAGCTTCCGC GTATGGGCAA TCAACGTGCC GGGCGACAAG
CGGATCGGCA AGATCGAGCT GCTCGACACG CCGATGGTCT GGAACGGCAT GCCGGCGAAT
CCGACCGTGC TGCTCAGCCG GTGA
 
Protein sequence
MGNGERPPAR RPDSSGSPPP AADAPAASNH PFSSHDTKHM TSRRLASRTA VAASLSALML 
AACGGDDSAN APTAGGAAPL TPAVASPAGP TGSTPGSTPG ATTAPAPSST SAGQLSVDKM
AFAQTHVVPS GGLSWTLPNA SASLRPISRR DALVLVAIGQ ADAVQPVLEA WKDGAKLGAL
ALSPPSALPP TESGGRAYAN DRWSAVVPAA WMVPGVSFSV SASNYTSSVA QAPVFGTDAD
VQLTILPFYL FGADDTNSPP LSTTQAPDAA TQQEIFAKWP TAELKVRTHP AGRFSLATVV
VGPRADRTGA AQPAYPVTAL DQQKDGYGVM SAMLTLITNM RTANGDGPLN DQYYAPLIAL
NSNGQFANLG GGLGGVGSGA AVGDHRYTGI FIHEQGHAFG LNHAGDEYAK GAYPYAGGSL
SGSVWGYDPN HREFLDVLVP TTASSYAKCA SSHQLDAQGR CYKQDPMQGG AGDQSSGYKF
ATFSDYNTGR MQAWIASRVL ADPASSTGYS KWDSAAQARA PYTPTTDNNG LYGVNQNLPV
QAGVPVHTIV VSFSKAGSAG ASYIYPPFSY TGNLIATFDP TSAADRQAIT VDKGTYPWYC
KGTGCDYTLR VTYADGSQTY RVLQGGFRAW WTPTVDDANA TNPLSGSSFR VWAINVPGDK
RIGKIELLDT PMVWNGMPAN PTVLLSR