Gene BBta_4018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_4018 
SymbolaroG 
ID5151767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp4222286 
End bp4223371 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content66% 
IMG OID640558849 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001239990 
Protein GI148255405 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGAGCA CCACAGACGA TCTTCGTATC AAGGAAATCA AGGAATTGAG CCTGCCGGTG 
GAGGTGATGG GTGAGATCCC GCGCACCCTC ACCGCCACCC GCGTCGTGAT GGCCGCGCGC
AACGCGATCC ACGCCATCCT GAACGGCACT GACGATCGCC TGCTGGTCAT CGTCGGCCCC
TGCTCCATTC ACGATCCGAA GGCCGCGATC GATTATGCCG GCCGCCTCGC CGCCTTGCGC
GAGCGTCTCA ATGACCGGCT GGAAATCGTG ATGCGGGTCT ATTTCGAGAA GCCGCGCACC
ACGGTGGGCT GGAAGGGCCT GATCAACGAC CCCGATCTCA ACAATTCCTT CGACATCAAC
AAGGGCCTGC GGCTGGCGCG CAACGTGCTG TCCGCCGTCA ACAATCTCGG CCTGCCCGCG
GGATGCGAAT TCCTCGACAT GACAACGCCG CAATACATTG CCGACCTCGT CGCCTGGGCC
GCGATCGGCG CGCGCACCAC CGAGAGCCAG ATCCATCGCG AGCTGGCCTC CGGCCTGTCC
TGCCCGGTCG GCTTCAAGAA CGGCACCGAC GGCAACATCC GCATCGCCGC CGACGCGGTG
AAATCAGCCG CCCATCCGCA TCATTTCATG GCGGTCACCA AAGGCGGGCG CTCCGGCATC
GCGGCCACCA CCGGCAATGA AGACTGCCAC ATCATCCTGC GCGGCGGCAG CGGCCCGAAC
TACGACGCGG CCCATGTCGA GCAGGCCGCG AGCGAGCTGG TCAAGGCCGG CCTGCCGGCG
CGGCTGATGA TCGACACCAG CCACGCCAAT TCCAGCAAGA AGCCGGAGAA CCAGCCGCTC
GTCGCCGCAG ATATCGCCGG CCAGCTCGCG GCCGGCGAGC AGCGCATCAT GGGCGTGATG
ATCGAGAGCC ATCTCGTCGC CGGTCGCCAG GACGTCAAGC CGGGCGTGCC GCTGACCTAT
GGCCAGAGCA TCACCGACGG CTGTATCGGT TGGGACACGA CGGTCGCGGT GCTGGAGCAA
CTCGCCGACG CCGTCACCAC GCGGCGGACA CGGCGCAACG AGTCCGTGCG CGAGCGCTCG
GCGTAA
 
Protein sequence
MLSTTDDLRI KEIKELSLPV EVMGEIPRTL TATRVVMAAR NAIHAILNGT DDRLLVIVGP 
CSIHDPKAAI DYAGRLAALR ERLNDRLEIV MRVYFEKPRT TVGWKGLIND PDLNNSFDIN
KGLRLARNVL SAVNNLGLPA GCEFLDMTTP QYIADLVAWA AIGARTTESQ IHRELASGLS
CPVGFKNGTD GNIRIAADAV KSAAHPHHFM AVTKGGRSGI AATTGNEDCH IILRGGSGPN
YDAAHVEQAA SELVKAGLPA RLMIDTSHAN SSKKPENQPL VAADIAGQLA AGEQRIMGVM
IESHLVAGRQ DVKPGVPLTY GQSITDGCIG WDTTVAVLEQ LADAVTTRRT RRNESVRERS
A