Gene BURPS1710b_3336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_3336 
Symbol 
ID3689998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp3656548 
End bp3657810 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content67% 
IMG OID637729791 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_334707 
Protein GI76808557 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCGCG AAGCGCAACC GAATTCCCGA AAAACCGCCG GCGAACCCGG CGGTTTTTTT 
TCGCCCCGCC GGTTCGCAAG CAGGGACGAC GGGCCGATCC ACCGCTTTAC CGAATCACCG
ATTTGTCGAA TCGAACCGCC AGCCGCACCC GCGCACGCCG GGCGGCGCAC CGAACCGGAG
AACTCAAGCA TGCCCCCGCA CAATACCGAC GACGTCCGCA TCCGTGAACT GAAGGAGCTG
ACTCCGCCCG CCCACCTGAT CCGCGAATTC GCGCTCGGCG AGGCGGTGTC GGAGCTCATC
TACAACGCGC GCCAGGCGAT GCACCGGATC CTGCACGGGA TGGACGATCG CCTGATCGTC
ATCATCGGGC CGTGCTCGAT CCACGACACG AAGGCGGCGC TCGAATACGC GGGCCGGCTC
GTCCAGGAGC GCGAGCGCTT CGCAAGCGAG CTCGAGATCG TGATGCGCGT GTACTTCGAG
AAGCCGCGCA CGACGGTCGG CTGGAAGGGG CTCATCAACG ATCCGCACCT GGATAACAGC
TTCAAGATCA ACGACGGCCT GCGCACCGCG CGCGAGCTGC TGCTGCAGAT CAACGAGATG
GGGCTGCCCG CCGGCACCGA ATACCTCGAC ATGATCAGCC CGCAGTACAT CGCGGACCTG
ATCTCGTGGG GCGCGATCGG CGCGCGCACG ACCGAATCGC AGGTGCACCG CGAGCTCGCG
TCGGGGCTGT CGTGCCCGGT CGGCTTCAAG AACGGCACCG ACGGCAACGT GAAGATCGCG
GTCGACGCGA TCAAGGCCGC ATCGCAGCCG CACCATTTCC TGTCGGTGAC GAAGGGCGGC
CATTCGGCGA TCGTGTCGAC GGCCGGCAAC GAGGACTGCC ACGTGATCCT GCGCGGCGGC
AAGGCGCCGA ACTACGATGC CGACAGCGTG AACGCCGCGT GCGCGGACAT CGGCAAGGCC
GGCCTCGCCG CGCGCCTGAT GATCGACGCG AGCCATGCGA ACAGCTCGAA GAAGCACGAG
AACCAGATTC CGGTATGCGC GGACATCGGC CGCCAGATCG CCGCGGGCGA CGAGCGCATC
GTCGGCGTGA TGGTCGAGTC GCACCTCGTC GAAGGCCGCC AGGACCTGAA GGAAGGCTGC
CCGCTCACGT ACGGCCAGAG CATCACCGAT GCATGCATCA ACTGGGACGA CAGCGTGAAG
GTGCTCGAAG GGCTCGCCGA AGCGGTGAAG GCGCGGCGCG TCGCGCGCGG CAGCGGCAAC
TGA
 
Protein sequence
MAREAQPNSR KTAGEPGGFF SPRRFASRDD GPIHRFTESP ICRIEPPAAP AHAGRRTEPE 
NSSMPPHNTD DVRIRELKEL TPPAHLIREF ALGEAVSELI YNARQAMHRI LHGMDDRLIV
IIGPCSIHDT KAALEYAGRL VQERERFASE LEIVMRVYFE KPRTTVGWKG LINDPHLDNS
FKINDGLRTA RELLLQINEM GLPAGTEYLD MISPQYIADL ISWGAIGART TESQVHRELA
SGLSCPVGFK NGTDGNVKIA VDAIKAASQP HHFLSVTKGG HSAIVSTAGN EDCHVILRGG
KAPNYDADSV NAACADIGKA GLAARLMIDA SHANSSKKHE NQIPVCADIG RQIAAGDERI
VGVMVESHLV EGRQDLKEGC PLTYGQSITD ACINWDDSVK VLEGLAEAVK ARRVARGSGN