Gene BMASAVP1_A0486 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMASAVP1_A0486 
SymbolaroG 
ID4680423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei SAVP1 
KingdomBacteria 
Replicon accessionNC_008785 
Strand
Start bp481094 
End bp482356 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content66% 
IMG OID639844763 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_991835 
Protein GI121599772 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCGCG AAGCGCAACC GAATTCCCGA AAAACCGCCG GCGAACCCGG CGGTTTTTTT 
TCGCACCGCC GGTTCGCAAG CAGGGACGAC GGGCCGATCC ACCGCTTTAC CGAATCACCG
ATTTGTCGAA TCGAACCGCC AGCCGCACCC GCGCACGCCG GGCGGCGCAC CGAACCGGAG
AACTCAAGCA TGCCCCCGCA CAATACCGAC GACGTCCGCA TCCGTGAACT GAAGGAGCTG
ACTCCGCCCG CCCACCTGAT CCGCGAATTC GCGCTCGGCG AGGCGGTGTC GGAGCTCATC
TACAACGCGC GCCAGGCGAT GCACCGGATC CTGCACGGGA TGGACGATCG CCTGATCGTC
ATCATCGGGC CGTGCTCGAT CCACGACACG AAGGCGGCGC TCGAATACGC GGGCCGGCTC
GTCCAGGAGC GCGAGCGCTT CGCAAGCGAA CTCGAGATCG TAATGCGCGT GTACTTCGAG
AAGCCGCGCA CGACGGTCGG CTGGAAGGGG CTCATCAACG ATCCGCACCT GGATAACAGC
TTCAAGATCA ACGACGGCCT GCGCACCGCG CGCGAGCTGC TGCTGCAGAT CAACGAGATG
GGGCTGCCCG CCGGCACCGA ATACCTCGAC ATGATCAGCC CGCAATACAT CGCGGACCTG
ATCTCGTGGG GCGCGATCGG CGCGCGCACG ACCGAATCGC AGGTGCACCG CGAGCTCGCG
TCGGGGCTGT CGTGCCCGGT CGGCTTCAAG AACGGCACCG ACGGCAACGT GAAGATCGCG
GTCGACGCGA TCAAGGCCGC ATCGCAGCCG CACCATTTCC TGTCGGTGAC GAAGGGCGAC
CATTCGGCGA TCGTGTCGAC GGCCGGCAAC GAGGACTGCC ACGTGATCCT GCGCGGCGGC
AAGGCGCCGA ACTACGATGC CGACAGCGTG AACGCCGCGT GCGCGGACAT CGGCAAGGCC
GGCCTCGCCG CGCGCCTGAT GATCGACGCG AGCCATGCGA ACAGCTCGAA GAAGCACGAG
AACCAGATTC CGGTATGCGC GGACATCGGC CGCCAGATCG CCGCGGGCGA CGAGCGCATC
GTCGGCGTGA TGGTCGAGTC GCACCTCGTC GAAGGCCGCC AGGACCTGAA GGAAGGCTGC
CCGCTCACGT ACGGCCAGAG CATCACCGAT GCATGCATCA ACTGGGACGA CAGCGTGAAG
GTGCTCGAAG GGCTCGCCGA AGCGGTGAAG GCGCGGCGCG TCGCGCGCGG CAGCGGCAAC
TGA
 
Protein sequence
MAREAQPNSR KTAGEPGGFF SHRRFASRDD GPIHRFTESP ICRIEPPAAP AHAGRRTEPE 
NSSMPPHNTD DVRIRELKEL TPPAHLIREF ALGEAVSELI YNARQAMHRI LHGMDDRLIV
IIGPCSIHDT KAALEYAGRL VQERERFASE LEIVMRVYFE KPRTTVGWKG LINDPHLDNS
FKINDGLRTA RELLLQINEM GLPAGTEYLD MISPQYIADL ISWGAIGART TESQVHRELA
SGLSCPVGFK NGTDGNVKIA VDAIKAASQP HHFLSVTKGD HSAIVSTAGN EDCHVILRGG
KAPNYDADSV NAACADIGKA GLAARLMIDA SHANSSKKHE NQIPVCADIG RQIAAGDERI
VGVMVESHLV EGRQDLKEGC PLTYGQSITD ACINWDDSVK VLEGLAEAVK ARRVARGSGN