Gene BURPS1106A_A2691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2691 
Symbol 
ID4905316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2630486 
End bp2631652 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content66% 
IMG OID640145794 
Producthypothetical protein 
Protein accessionYP_001076721 
Protein GI126455958 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGTGA CACAATCCAT TAGTATTCCT ATCCATTACC CGGCCGCGAC GGCCGCATTG 
CTCTTGCTGC TGCTCACCGG TTGCGGCGGC GGCGGCGACC AGAGCAAGGT CAACGCCGCC
GCCTCGCCCG CGAACAACCT CGTCGTGCCG GCGCCCGGCA CGGCGTCGCC CGGCACGCCC
GCGCCCGCGC CCGGCGCGCC GGCGCCCGCC GAGACGGCTT CGGTGCTGCC GTTCTTCGGC
GTGAACGGCC ATTACGTCGA CGGCGGCGTC TACGCGTCGG TCCCGCTCGC CACGCAGGCA
AGCCACCTCG CCGGCCTCGG CATGAACGTC TACCGGCAGG ACGTGTACAT TCCGGATCAC
GTCGACACGC TCGCGTCGAC GGTCATTCCC GGCCTCGGTT CCGGCATCAC GGTCCTGCCG
ATGATCCAGG CGCATCCATG GGCCGATCCG TCGCTGAACG GCCAACCGCC GACCGAAGCC
AGCGCGTATG CGTACGCCTA CAAGCTGGCC GCCTACGCGG CGAAGAAGCT CGCCGGCATT
CCGATGGTGG AGTTCGGCAA CGAGTACGAC ATCGATAGCC ACAACGCGCC GATCCAGGGC
GACGGCATCA ATGTTTCGGA CTACGACAAT TCCACGTTCC CGATCTGGCG CGGCGCGCTC
CGAGGCTCGC TCGACGGCTG GCGCTCGGTC GACACGAACC GCACGACGAA GCTGATCGCG
AACGCAACGT CGGGGGCGCT GCATTTCGGC TTCCTCGACG GCCTGATGAC GGGCACGCAG
CCCGACGGCA CGACCGGGCA TCCGAAGATC ACGCCCGACG TGATCCAGTG GCACTGGTAT
TCGAACGGCG GCGATTTCGA GAACGCGCTC GGCAAGACCG GCCGATACAA CGTGCTTGCG
CGGCTGAAGG ACCGCTACAA CCTGCCGATC GTCGTCACCG AGATCGGCGT GAACACGGAC
AACTCCGACA CGCAGATCGC CGCGTACATC GCAAAGACGA TCCCCGAGCT GGTTGCGGCG
AAAGCCGCGT ACAACGTCAT CGGCTTCAAC TGGTATGAGC TTTACGACGA CCGCAGCGGC
GCTTACGGCT TGCTGACGAA CAGCGCACAG GAAAAGCCCC GTTACGGACT CATGCGCGCG
GCGATCGCCG GCGCCGTGCC GAACTGA
 
Protein sequence
MSVTQSISIP IHYPAATAAL LLLLLTGCGG GGDQSKVNAA ASPANNLVVP APGTASPGTP 
APAPGAPAPA ETASVLPFFG VNGHYVDGGV YASVPLATQA SHLAGLGMNV YRQDVYIPDH
VDTLASTVIP GLGSGITVLP MIQAHPWADP SLNGQPPTEA SAYAYAYKLA AYAAKKLAGI
PMVEFGNEYD IDSHNAPIQG DGINVSDYDN STFPIWRGAL RGSLDGWRSV DTNRTTKLIA
NATSGALHFG FLDGLMTGTQ PDGTTGHPKI TPDVIQWHWY SNGGDFENAL GKTGRYNVLA
RLKDRYNLPI VVTEIGVNTD NSDTQIAAYI AKTIPELVAA KAAYNVIGFN WYELYDDRSG
AYGLLTNSAQ EKPRYGLMRA AIAGAVPN